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NEAR-CRITICAL SIR EPIDEMIC ON A RANDOM GRAPH WITH 

GIVEN DEGREES 


SVANTE JANSON, MALWINA LUCZAK, PETER WINDRIDGE, AND THOMAS HOUSE 

Abstract. Emergence of new diseases and elimination of existing diseases is a key public 
health issue. In mathematical models of epidemics, such phenomena involve the process of 
infections and recoveries passing through a critical threshold where the basic reproductive 
ratio is 1. 

In this paper, we study near-critical behaviour in the context of a susceptible-infective- 
recovered (SIR) epidemic on a random (multi)graph on n vertices with a given degree 
sequence. We concentrate on the regime just above the threshold for the emergence of 
a large epidemic, where the basic reproductive ratio is 1 + ^(njrU 1 / 3 , with ui{ri) tending 
to infinity slowly as the population size, n, tends to infinity. We determine the probability 
that a large epidemic occurs, and the size of a large epidemic. 

Our results require basic regularity conditions on the degree sequences, and the assump¬ 
tion that the third moment of the degree of a random susceptible vertex stays uniformly 
bounded as n —> oo. As a corollary, we determine the probability and size of a large near- 
critical epidemic on a standard binomial random graph in the ‘sparse’ regime, where the 
average degree is constant. As a further consequence of our method, we obtain an improved 
result on the size of the giant component in a random graph with given degrees just above 
the critical window, proving a conjecture by Janson and Luczak. 


1. Introduction 

Infectious diseases continue to pose a serious threat to individual and public health. Ac¬ 
cordingly, health organisations are constantly seeking to analyse and assess events that may 
present new challenges. These may include acts of bioterrorism, and other events indicating 
emergence of new infections, which threaten to spread rapidly across the globe facilitated 
by the efficiency of modern transportation. Likewise, a lot of effort is being directed into 
suppressing outbreaks of established diseases such as influenza and measles, as well as into 
eliminating certain endemic diseases, such as polio and rabies. 

In an SIR epidemic model, an infectious disease spreads through a population where each 
individual is either susceptible, infective or recovered. The population is represented by a 
network (graph) of contacts, where the vertices of the network correspond to individuals 
and the edges correspond to potential infectious contacts. Different individuals will have 
different lifestyles and patterns of activity, leading to different numbers of contacts; for 
simplicity, we assume that each person’s contacts are randomly chosen from among the rest 
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of the population. The degree of a vertex is the number of contacts of the corresponding 
individual. 

We assume that infectious individuals become recovered at rate p ^ 0 and infect each 
neighbour at rate f3 > 0. Then the basic reproductive ratio (i.e. the average number of 
secondary cases of infection arising from a single case) is given by the average size-biased 
susceptible degree times the probability that a given infectious contact takes place before 
the infective individual recovers. 

Emergence and elimination of a disease involves the process of infectious transitions and 
recoveries being pushed across a critical threshold, usually corresponding to the basic re¬ 
productive ratio IZq equal to 1, see BQ (HI ; [ 33 !]. For example, a pathogen mutation can 
increase the transmission rate and make a previously ‘subcriticah disease (i.e. not infectious 
enough to cause a large outbreak) into a ‘supercritical’ one, where a large outbreak may 
occur, see jl]. Moreover, after a major outbreak in the supercritical case, disease in the 
surviving population is subcriticah However, subsequently, as people die and new individu¬ 
als are born (i.e. immunity wanes), will slowly increase, and, when it passes 1, another 
major outbreak may occur. Equally, efforts at disease control may result in subcriticality 
for a time, but then inattention may lead to an unnoticed parameter shift to supercritical¬ 
ity. Thus, under certain conditions, one can expect most large outbreaks to occur close to 
criticality, and so there is practical interest in theoretical understanding of the behaviour of 
near-critical epidemics. 

Critical SIR epidemics have been studied for populations with complete mixing, under 


different assumptions, by 13; 1 3; 24]; this is equivalent to studying epidemic processes on 
the complete graph, or on the Erdos-Renyi graph G(n,p). In 0], near-criticality is discussed 


using non-rigorous arguments. Martin-Lof [24| studies a generalized Reed-Frost epidemic 
model, where the number of individuals that a given infective person infects has an essentially 
arbitrary distribution. The binomial case is equivalent to studying the random graph G(n,p ) 
on n vertices with edge probability p. The author considers the regime where TZq — 1 = 
an~ 1 / 3 and the initial number of infectives is fen 1//3 , for constant a, b. A limit distribution 
is derived for the final size of the epidemic, observing bimodality for certain values of a 
and b (corresponding to ‘small’ and ‘lar ge’ e pidemics). Further analytical properties of the 
limit distribution are derived in 14|. In [lB], a standard SIR epidemic for populations with 
homogeneous mixing is studied, with vaccinations during the epidemic; a diffusion limit is 
derived for the final size of a near-critical epidemic. 

In the present paper, we address near-critical phenomena in the context of an epidemic 
spreading in a population of a large size n, where the underlying graph (network) is a random 
(multi)graph with given vertex degrees. In other words, we specify the number of contacts 
for each individual, and consider a graph chosen uniformly at random from among all graphs 
with the specified sequence of contact numbers. This random graph model allows for greater 
inhomogeneity, with a rather arbitrary distribution of the number of contacts for different 
persons. We study the regime just above the critical threshold for the emergence of a large 
epidemic, where the basic reproductive ratio is 1 + u(n)n~ 1 ^ , with uj(n) growing large as 
the population size n grows. (For example, when the population size is about 1 million, we 
could consider 7Z 0 of order about 1.01.) 

From the theory of branching processes, at the start of an epidemic, each infective indi¬ 
vidual leads to a large outbreak with probability of the order 7Z 0 — 1. Roughly, our results 
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confirm the following, intuitively clear from the above observation, picture. If the size n of 
the population is very large, with the initial total infectious degree Ayo (he. total number 
of potential infectious contacts at the beginning of the epidemic or total number of acquain¬ 
tances of initially infectious individuals) much larger than (IZq — 1) _1 , then a large epidemic 
will occur with high probability. If the initial total infectious degree is much smaller than 
( TZq — l) -1 , then the outbreak will be contained with high probability. In the intermediate 
case where A I0 and (1Z 0 — l) -1 are of the same order of magnitude, a large epidemic can 
occur with positive probability, of the order exp(— cAi i0 (7^o — 1 )), for some positive constant 
c. So, if the population size is about a million, and TIq about 1.01, then Ai )0 much larger 
than 100 will result in a large epidemic with high probability. On the other hand, if A^o is 
less than 10, say, than the outbreak will be contained with high probability. 

Furthermore, we determine the likely size of a large epidemic. Here, there are three possible 
regimes, depending on the size of the initial total infectious degree relative to n(7l 0 — l) 2 . 
Broadly speaking, if Ayo is much larger than n(lZo — l) 2 , then the total number of people 
infected will be proportional to (nA^o) 1 / 2 - On the other hand, if A^o is much smaller than 
n(JZo — l) 2 then, in the event that there is a large epidemic, the total number of people 
infected will be proportional to n(JZo — 1). The intermediate case where Ai,o and n(lZo — l) 2 
are of the same order ‘connects’ the two extremal cases. 

Note that, if Ai j0 is of the same or larger order of magnitude than n(Tl 0 — l) 2 (the first 
and third case in the paragraph above), then Xi 0 (TZ 0 — 1) is very large, so a large epidemic 
does occur with high probability. This follows since, by our assumption, n(7^ 0 — l) 3 = co(n) 3 
is large for large n. 

The above results are proven under fairly mild regularity assumptions on the shape of the 
degree distribution. We allow a non-negligible proportion of the population to be initally 
recovered, i.e. immune to the disease. (This also allows for the possibility that a part, not 
necessarily random, of the population is vaccinated before the outbreak, since the vaccinated 
individuals can be regarded as recovered.) We require that the third moment of the suscep¬ 
tible degree be bounded; in particular, that implies that the maximum susceptible degree in 
a population of size n is of the order no larger than n 1//3 . So in particular, in a population of 
size 1 million, the super-spreaders (i.e. individuals with largest numbers of contacts) should 
not be able to infect more than around 100 individuals. 

To demonstrate this behaviour for a particular example, we used stochastic simulations 
that make use of special Monte Carlo techniques that allow us to consider multiple initial 
conditions within the same realisation of the process. The algorithm is described in Ap¬ 
pendix [A] Figure |T| shows our results for the relationship between the epidemic final size Z 
and the initial force of infection Xj 0 for 20 realisations of the process, with each realisation 
involving multiple different initial conditions, for population sizes n = 10 5 , n = 10 6 and 
n = 10'. The model rate parameters are p = 1 and ,5 = 1; and the network has Poisson de¬ 
gree distribution with mean A = 2.02 meaning that IZq = 1.01. (The network was generated 
as an Erdos-Renyi random graph with edge probability A /n.) These plots show the emer¬ 
gence of the three epidemic sizes that our results predict as n increases, i.e. ‘small’ epidemics 
of size 0(1), ‘large’ epidemics of size proportional to (nAyo) 1 ^ 2 , and ‘large’ epidemics of size 
comparable to n(Jlo — 1). 

Epidemics on graphs with given degrees have been considered in a number of recent stud¬ 
ies, both within the mathematical biology and probability communities. A set of ordinary 
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case where the fifth moment of the degree of a random vertex is uniformly bounded and 
in the case of bounded vertex degrees respectively. See also jd] for results in the case of 
bounded vertex degrees and general infection time distributions.) 

However, we appear to be the first to study the ‘barely supercritical’ SIR epidemic on a 
random graph with given degrees. As a corollary, we determine the probability and size of 
a large near-critical epidemic on a sparse binomial (Erdos-Renyi) random graph, also to our 
knowledge the first such results in the literature. 

Our approach also enables us to prove the conjecture of Janson and Luczak in [20[, es¬ 
tablishing their Theorem 2.4 concerning the size of the largest component in the barely 
supercritical random graph with given vertex degrees under weakened assumptions. 

We proceed in the spirit of jH] and (ioj, evolving the epidemic process simultaneously 
with constructing the random multigraph. The main technical difficulties involve delicate 
concentration of measure estimates for quantities of interest, such as the current total degrees 
of susceptible, recovered and infective vertices. Also, our proofs involve couplings of the 
evolution of the total infective degree with suitable Brownian motions. 

The remainder of the paper is organised as follows. In Section [21 we define our notation 
and state our main results (Theorems 12.31 and 12.41) . Section [3] is devoted to the proof of 
Theorem 12.31 to this end, we define a time-changed version of the epidemic and use the 
modified process to prove concentration of measure estimates for various quantities of inter¬ 
est. In Section [4] we prove Theorem 12.41 In Appendix [Bj we state and prove a new result 
concerning the size of the giant component in the supercritical random (multi)graph with a 
given degree sequence. 

2. Model, notation, assumptions and results 

Let n E N and let (dj)\? =1 = (d\ n ^)™ =1 be a given sequence of non-negative integers. Let 
G = G(n, (di)l Li) be a simple graph (no loops or multiple edges) with n vertices, chosen 
uniformly at random subject to vertex i having degree d, for i — 1,... , n, tacitly assuming 
there is any such graph at all (J2™ =1 d 7 must be even, at least). For each k E Z + , let n k 
denote the total number of vertices with degree k. 

Given the graph G, the epidemic evolves as a continuous-time Markov chain. Each vertex 
is either susceptible, infective or recovered. Every infective vertex recovers at rate p n ^ 0 
and also infects each susceptible neighbour at rate f3 n > 0. 

Let ns, ni, and Ur denote the initial numbers of susceptible, infective and recovered 
vertices, respectively. Further, let ns,£, n \,k and n^ k respectively, be the number of these 
vertices with degree k ^ 0. Thus, ns + n\ + n^ = n and ns = J2T=o n s,k, n i = 
nR = J2T=o n ^ k , and n k = ,k + n i,k + n R,k- We assume that this information is given 
with the degree sequence. Note that all these quantities (as well as many of the quantities 
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introduced below) depend on n. To lighten the notation, we usually do not indicate the n 
dependence explicitly. 

Remark 2.1. We allow > 0, i.e., that some vertices are “recovered” (i.e., immune) 
already when we start. It is often natural to take tir = 0, but one application of Ur > 0 is 
to study the effect of vaccination; this was done in a related situation in 0lj] and we leave 
the corresponding corollaries of the results below to the reader. Note that initially recovered 
vertices are not themselves affected by the epidemic, but they influence the structure of the 
graph and thus the course of the epidemic, so we cannot just ignore them. 


The basic reproductive ratio TZq is commonly used in the context of epidemic models, and 
defines the average number of new cases created by a case of infection. In analogy with the 
(2.23)], for the SIR epidemic on a random graph with a given degree 

fin Y,T=o( k ~ l ) kn S,k 
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limiting case in 
sequence, we define 


n, = n := 


+ fin 


Er = 0 kn k 


( 2 . 1 ) 


Pn i Wi A^k =0 ' 

Here, the probability that an infective half-edge infects another half-edge before recovering 
—- and the average increase in the number of infective half-edges due to such an 

SfcLn { k ~ l ) kn s,k 


IS 


Pn~\~0‘, 

infection event is 


and these are approximately independent of one another, 


0 ^ n k 

and approximately independent for different half-edges. 

Note that the basic reproductive ratio TZq determines the approximate geometric growth 
rate of the disease during the early stages of the epidemic. The value TZ 0 = 1 is therefore 
the threshold for the epidemic to take off in the population, in the sense that, if 71q > 1, 
then a macroscopic fraction of the susceptibles can be infected 0: 0; H 0 0. Here we 
will consider the case where TZq = 1+ o;(n)n' 1 / 3 , with u(n) tending to infinity slowly (slower 
than n 1 / 3 ) as n —* oo. 

It turns out that, rather than working with the quantity TZq — 1, it is easier to work with 
a quantity a n defined by 


Q„ := -(1 + Pn/ftn) 51 knt/ns - Y" kik - 1 )n s j,/n s . 


( 2 , 2 ) 


k =0 


k =0 


Note that 

a n = ( n 0 - i) ^ + ^Er=o^ (2.3) 

fin n S 

Our assumptions below imply that 1 ^ £h+Pji — 0(1) and that kllk is bounded and 
bounded away from 0 as n —> oo, see Remark 12.81 Hence (7 Z 0 — l)^^ 1 is bounded and 
bounded away from 0, and so a n is equivalent to 7^. 0 — 1 as a measure of distance from 
criticality; see further (12.221) . In particular, we could rephrase our assumptions and results 
in terms of TZq — 1 instead of a n , but it seems that the mathematics works out more cleanly 
using a n . Also, we can expect an initial growth if and only if a n > 0. 

We consider asymptotics as n —> oo, and all unspecified limits below are as n — > oo. 
Throughout the paper we use the notation o p in a standard way, as in [l8[. That is, for 
a sequence of random variables (y( n ))J° and real numbers (a„)“, ‘ Y^ = o p (a n )’ means 

Y^/a n —^ 0. Similarly, = O p (l) means that, for every e > 0, there exists K e such 
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that P(|Y'(G| > AT) < e for all n. Given a sequence of events (A ri ) ( j >0 , A n is said to hold 
w.h.p. (with high probability) if P(A n ) —> 1. 


Our assumptions are as follows. (See also the remarks below.) Let D S n denote the degree 
of a randomly chosen susceptible vertex, so P(Dg in — k) — ng^/ng for each k ^ 0. 

(Dl) Ds, n converges in distribution to a probability distribution (pk)^L 0 with a finite and 
positive mean A := be. 

—— —> Pk, k ^ 0. (2.4) 

ns 

(D2) The third power Dg n is uniformly integrable as n —> oo. That is, given e > 0, there 
exists M > 0 such that, for all n, 


E 

k>M 


k 3 n s ,k 

n s 


< £. 


(2.5) 


(D3) The second moment of the degree of a randomly chosen vertex is uniformly bounded, 
be- Sfcto k 2n k = 0(n). 

(D4) As n —> oo, 

a n —* 0 and ngaj) —* oo. (2.6) 


(D5) The total degree YlkL o °f initially infective vertices satishes 

OO 

= o(w), (2.7) 

k =o 

and the limit 

^ OO 

z/ := lim -- fcuifc G [0, oo] (2.8) 

exists (but may be 0 or oo). Furthermore, either v — 0 or 

OO 

di := max{fc : rq^ ^ 1} = kn^k). (2.9) 

fc =0 

(D6) We have Po + Pi + P 2 < 1- 
(D7) lim inf^oo n s /n > 0. 


We will repeatedly use the fact that (D2) implies that there exists a constant Co such that, 
for all n, 


T k 3 n Sl i, = n s E Di v < c 0 n. 


( 2 . 10 ) 


k =0 


Remark 2.2. Assumption (Dl) says Ds, n —> D g, where Dg has distribution (pk)kL 0 - Given 
|(D1)[ assumption (D2) is equivalent to E D$ n 
uniform integrability of Ds, n and A>g n , so EDs, 


—)• EDg < 


00. 


(D2) and (D7) further imply that 

ds,* '■= max{/c : ns,k ^ 1} = o(ng 3 ). 


Furthermore, (D2)| implies 
—* ETg and ET| n —> EDg. Assumptions 
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( 2 . 11 ) 















Using the notation in Remark 12.21 A = ET>s. Furthermore, let 


A 2 := ^k(k - 1 )p k = E D S (D S - 1), 

k=0 

oo 

A 3 := Y, k(k - 1 )(k - 2 )p„ = E D$(D S - 1 )(D S - 2). 


( 2 . 12 ) 

(2.13) 


k =0 


Then, the uniform integrability (D2) of D g n implies A 2 , A 3 < 00 and furthermore 


A 2 = lim Eh s , n (hs, n - 1) = lim V] k(k - 1) 

n. — n .—Vno * 


k =0 


ns ,k 
ns ’ 


A 3 = lim ED s ,n(Ds,n ~ 1)(-Ds,n - 2 ) = lim V] k(k -l)(k - 2) 

n.— n — ^ 


k =0 


^S,fc 

ns 


(2.14) 


(2.15) 


Also, A, A 2 , A 3 > 0 by (D6) 


Let G* = G*(n, (dj)i) be the random multigraph with given degree sequence (d*)" defined 
by the configuration model we take a set of di half-edges for each vertex i and combine 
half-edges into edges by a uniformly random matching (see e.g. [8j). Conditioned on the 
multigraph being simple, we obtain G = G(n, (di)™), the uniformly distributed random 
graph with degree sequence (di)™. The configuration model has been used in the study of 
epidemics in a number of earlier works, see, for example, Q3E3E30- As in many other 
papers, including [2lj], we prove our results for the SIR epidemic on G*, and, by conditioning 
on G* being simple, we then deduce that these results also hold for the SIR epidemic on 
G. The results below thus hold for both the random multigraph G* and the random simple 
graph G. 

This argument relics on the probability that G* is simple being bounded away from zero 
as n —y 00 ; by the main theorem of [l7[ (see also lj|) this occurs provided condition (D3) 


holds. Most of the results below are of the “w.h.p.” type (or can be expressed in this form); 
then this transfer to the simple graph case is routine and will not be commented on further. 
The exception is Theorem 12.^fni) where we obtain a limiting probability strictly between 0 


and 1, and we therefore need a more complicated argument, see Section SI we also use an 
extra assumption in this case. 

We now state our main result, that, under the conditions above, the epidemic is either 
very small, or of a size at least approximatively proportional to na n (and thus to u(TZq — 1)). 
As just said, the theorem holds for both the multigraph G* and the simple graph G. 


Theorem 2.3. Suppose that (D1)-(D7) hold , 


Let Z be the total number of susceptible vertices that ever get infected. 

(i) If u — 0, then there exists a sequence e n —> 0 such that, for each n, w.h.p. one of the 
following holds. 

(a) Z/nsOL n < e n (the epidemic is small and ends prematurely). 

(b) \Z/nsa n — 2A/A 3 | < e n (the epidemic is large and its size is well concentrated). 

(ii) If 0 < v < 00 , then Z/nsa n A(1 + y/l + 2z/A 3 )/A 3 . 


















(iii) U v = oo ; then 


Z 


(™s ET=o kn i,k) 


1/2 


V2X 

y/Xf 


(2.16) 


Moreover, in cases (i) 'b) 


k ^ 0 susceptible vertices that ever get infected, 

Zk kp k 


and (iii), the following holds. Let Z k be the number of degree 
Then 


E 

k=0 


Z 


X 


0. 


(2.17) 


Thus, (I2.17P says that, except in the case (ij[a) the total variation distance between the 


degree distribution (Z k /Z) of the vertices that get infected and the size-biased distribution 
(kpk/ A) converges to 0 in probability. 

of Theorem 12.31 savs that, for a range of initial values of the number of 


Note that case 


infective half-edges (viz. when u — 0), if the epidemic takes off at all, then it has approx¬ 
imately the size (2A/A 3 )nsa n . Hence, in this range, the size of the epidemic does (to the 
first order) not depend on the initial number of infective half-edges (only the probability of 
a large outbreak does), so this can be seen as the “natural” size of an epidemic. This also 
means that in this range, most of the outbreak can be traced back to a single initial infective 
half-edge. 

However, when the initial number of infective half-edges number gets larger, the many 
small outbreaks coming from the different initially infective half-edges will add up to a 
substantial outbreak. So there is a threshold where this bulk of combined small outbreaks is 
of about the same size as the “natural” size of a large outbreak. The value v is, in the limit 
as n —y oo, the ratio of the initial number divided by this threshold, so it shows, roughly, 
whether the combined small outbreaks give a large contribution to the outbreak or not. Our 
theorem then shows that, if the initial number of infective half-edges is larger (to be precise, 
v > 0), then they force a larger outbreak, with a size that is proportional to the square root 
of the initial number of infective half-edges in the range v = oo. (For 0 < v < oo, there is a 
smooth transition between the two extremal cases.) 

The following result gives conditions for the occurrence of a large epidemic in Theo¬ 
rem [27JI)] I11 anticipation of later notation, let Xjo := YlT=o kn Ik be the total degree of 
initially infective vertices (i.e. the total number of initially infective half-edges). 


Theorem 2.4. Suppose that the assumptions of Theorem \2.3\ are satisfied with v — 0 

(i) If oi n X i >0 —> 0 


then Z = o p (a n 2 ) = o p (nsot n ), and thus case (7 'a) in Theorem 12.31 


occurs w.h.p. 

If a n Xi i0 —> 00 then case (i)|[b) in Theorem \2.S\ occurs w.h.p. 


(iii) Suppose that a n Xi 0 is bounded above and below. In the simple graph case, assume also 
that Yfkst 1 k 2 ni,k = o(n) and Yfk^a.- 1 ^ 2?7 -r ,k = o(n). Then both cases (i)|[a) and (i) 'b) 


in Theorem \2.3I occur with probabilities bounded away from 0 and 1. Furthermore, if 
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di * = o(A" Ii o), then the probability that case 


in Theorem \2.c\ occurs is 


exp 


A 2 + A + Y,T=o kn R,k/ns 

X2X3 


cx n -X-' 


1,0 


+ 0 ( 1 ). 


(2.18) 


Moreover, in the case the epidemic is small, Z = O p (a n 2 ). 


































Note that YlT=a ^ n R,fc/ n s i n (12.181) is bounded because of (D3) and (D7)[ and that (12.18[) 


holds in cases (i) and (ii) too. A more complicated formula extending (12.18jl holds also in 


the case when the condition dj,* = o(Ad,o) fails, see (j4.63[) in Remark 14.41 

Remark 2.5. The quantity v ^ 0 controls the initial number of infective contacts. If v > 0, 
so a large epidemic occurs by Theorem 12.31 then 


(\ n A [ Q' 1 


OO V-^OO 7 

V" , 3\2^k=o kn W 

/ , kni t k = ( n s a n )——^- 


fc=o 


ns«; 


—>■ oo, 


by (12.6p and (I2.8p : hence the condition in Theorem I2.-J|fii) holds automatically when v > 0. 

Remark 2.6. The condition (12. 7p that the total degree of initally infective vertices is o(n) 
is, by (D3) and the Cauchy-Schwarz inequality, equivalent to rij = o(n), at least if we ignore 
isolated infective vertices. Note that the opposite case, when ni/n has a strictly positive 
limit, is treated in (2ll. Theorems 2.6 and 2.7] (under otherwise similar assumptions). 

Remark 2.7. The assumption (12. 9 p (which is required only when v > 0) says that no single 
infective vertex has a significant fraction of the total infective degree. 


Remark 2.8. Assuming (Dl) and (D2) the assumption a n —> 0 in (D4) is equivalent to 
TZq —» 1, as said above. To see this, note that (Dl) and (D2) imply (see Remark [2.2p 


F krik/ns W kriS'k/ns = E D Sl n — E / '. - A > 0 


(2.19) 


fc =0 


k =0 


If a n — > 0, then (12.19p and (12.31) imply that 77 0 —> 1. 

Conversely, still assuming (Dl) and (D2), if 1Z 0 —)■ 1, then it follows easily from (12.11) that 

Pn/Pn = 0(1), (2.20) 

and also that 


^2 kn k = C)(^(k - l)/cn s , fc ) = 0(n s ). (2.21) 

k =0 k =0 

Hence, (12.31) implies that a n —> 0. 

To be precise, (12.31) and (12.ip yield by (12. 14p and IZo —> 1, 

«n = 7e "~ 1 ^ fc= ° (fc ~ 1)/CnS ’ fc = (1 + o(l))A 2 (77 0 - 1). (2.22) 

77 0 ii s 

Note that by combining the two parts of the argument, we have shown that our assump¬ 
tions (Dl), (D2) and (D4) imply (I2.20p and the complementary bounds (12.191) and (12 .2 ip . 
(This can also easily be seen using (12.2h .) 


Remark 2.9. We saw in Remark 12751 that (Dl), (D2) and (D4) imply (I2.2ip . Since n — n 0 ^ 
^kLokjik, it follows that n — no = O(ns). Hence, assumption (D7) is needed only to the 
exclude the rather trivial case that almost all of the population consist of isolated infective 
vertices, which cannot spread the epidemic. Note also that (D7) implies that it does not 
matter whether we use ?ig or n in estimates such as (12.lip . 
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2.1. G(n,p) and G(n,m). The results above apply to the graphs G(n,p) and G(n,m) by 
conditioning on the sequence of vertex degrees (which are now random), since given the 
vertex degrees, both G(n,p ) and G(n,m) are uniformly distributed over all (simple) graphs 
with these vertex degrees. Moreover, if n —» oo and p ~ X/n, or m ~ n\/2, for some A > 0, 
then the degree distribution is asymptotically Poisson Po(A). For G(n,p), this leads to the 
following result. 


Corollary 2.10. Suppose that (5 n > 0 and p n ^ 0 for each n G N. Let X ^ 1, and assume 
that Lh±£il \ as n oo. L e t n n — > 0, and consider the SIR epidemic on the random graph 

Pn 

G(n , A ( 1 ^ ??n ' > ) with infection rate (3 n and recovery rate p n . Suppose that there are n\ = o(n) 
initially infective vertices chosen at random, and all the other vertices are susceptible. Let 

+ <?„-(! +<?„)-■ (2.23) 

XRn n 


Then y n —* 0. Assume that ny 3 —> oo, and that p = lim exists 


(i) If p — 0, then there exists a sequence e n —> 0 such that for each n, w.h.p. one of the 
following holds. 

(a) Z/{n^ n ) < e n . 

(b) |Z/(ny n ) - 2| < e n . 


Moreover, the probability that (a) holds is 

exp(—(1 + A _1 )7 n n/) + o(l). 

In particular, 


(2.24) 


holds w.h.p. if ^nHi —> 0 and (b) holds w.h.p. if 7 n n\ —> 00 . 

(ii) If 0 < p < 00, then Z /ny n —1 + y'l + 2 p. 

(iii) If p = 00, then 


Z 


(nsni) 1 / 2 

The same holds for G(n,m ) with m = nA(l + rj n )/ 2 


V2. 


Proof. As said above, we condition on the vertex degrees. We have ns,k/ n s ~Pk '■ = 
P(Po(A) = k ) for every k ; for convenience, we use the Skorohod coupling theorem (22I . Theo¬ 
rem 4.30] so we may assume that this holds a.s. for each k; thus f!2.4p holds a.s. Similarly we 

hold a.s. Furthermore, 

oi n is now random, and it is easy to see from (j2,2j) that 


may assume that Y2k^ n k/n converges a.s., and then (D2) and (D3) 


ns 

n 


fin Pn 

Pn 


(1 + Pn) X + ((1 + Tjn) A) ^1 




(1 + Vn) A 2 7n + Op(n 1 ^“) — (A 2 + Op(l))7n 


(2.25) 


Repeating the Skorohod trick, we may thus assume also that a n /j n —> A 2 . Similarly we may 
assume A I 0 = ^ n iy = (l + Oinf 1 ^ 2 ))Xn\, and then (12.81) holds with v = p/ A 3 ; it is also 
easy to see that (12.91) may be assumed. Then all the conditions (Dl) - (D7) hold a.s., and the 
result follows as a consequence of Theorems 12.31 and 12.41 noting that D s ~ Po(A), and thus 
A 2 = A 2 and A 3 = A 3 . □ 
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3. Proof of Theorem 12.31 

3.1. Simplifying assumptions. We assume for convenience that rij = o(n). In fact, we 
may assume that rq.o = 0 by deleting all initially infective vertices of degree 0, since these 
are irrelevant; then rq = o(n ) as a consequence of (12.71) . Note that this will not affect 7 Z 0 , 
a n , v or the other constants and assumptions above. 

Similarly, we assume that initially there are no recovered vertices, that is n r = 0. It is easy 
to modify the proofs below to handle the case riR ^ 1. Alternatively, we may observe that 
our results in the case tir = 0 imply the corresponding results for general Ur by the following 
argument. (See [l6| for similar arguments in a related situation.) We replace each initially 
recovered vertex of degree k by k separate susceptible vertices of degree 1, so there are a total 
of Wr i0 := Y^k =o additional “fake” susceptible vertices of degree 1; this will not change 
the course of the epidemic (in the multigraph case) except that some of these fake susceptible 
vertices will be infected. (Note that they never can infect anyone else.) The alteration will 
not affect 77-o, although a n and the asymptotic distribution (p fc ) will be modified. Note that 


Xr i0 — O(ns) by (D3) and (D7) by considering suitable subsequences we may thus assume 
that XR t o/ns -» r for some r e [0, oo). It is easy see that the modified degree distribution 
satisfies all the assumptions above and that, if we use a prime to indicate quantities after the 
replacement, then n' s = ns + A"r, 0 rs-/ (l + r)n s , ~ at n /(l + r), n' s a' n = n s a n , u' = (1 + r)u, 
Pi = (pi + r )/(l + r )> Pk — Pk/(1 + r) for k ± 1, A' = (A T r)/(l + r), A' 2 = A 2 /(l + r), 
^3 = "W ( 1 '+ r ) - 

If case (i|[a) in Theorem 12.31 occurs for the modified process, it occurs for the original 


process too, since Z ^ Z r , and there is nothing more to prove. 

In the other cases, we have Z' —» 00 w.h.p. We note that of the n' si = ris.\ + A" R 0 
susceptible vertices of degree 1, A R 0 are fake. Conditioned on the number Z\ of susceptible 
vertices of degree 1 that get infected, the number Z' — Z = Z[ — Z\ of fake susceptible 
vertices that get infected has a hypergeometric distribution, and, using e.g. Chebyshev’s 
inequality, it follows that w.h.p. (leaving the simple modification when pi — r — 0 to the 
reader) 

Ar,o 


z' - z = z[-z 1 = 


n s ,i 


X] 


R,0 


-Z[ + o(Z') = 


Pi + r 


Z'+o(Z'). 


(3.1) 


By (12.171) and the relations above, this yields w.h.p. 


Z' -Z = 


Pi 


-Z[ + o(Z') = 


Pi 


Pi + r A 


7 Z' + o(Z') = —Z> + o(Z'). 
' A + r 


(3.2) 


Consequently, w.h.p. ZjZ' = A/(A + r) + o(l). 

It is then easy to check that Theorem 12.31 and Theorem 12.41 for the original process both 
follow from these results in the case with no initially recovered vertices. 

We make these simplifying assumptions rq = o(n) and Ur = 0 throughout this section 
(and the following one), in addition to (D1)-(D7). In particular, n\ + tir = o(n), and thus 
(D7) is strengthened to 

ris/n —> 1. (3.3) 

We may also assume a n > 0, by ignoring some small n if necessary. Finally, recall that in 
the proofs we first consider the random multigraph G*. 
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3.2. Time-changed epidemic on a random multigraph. We first study the epidemic 
on the configuration model multi grap h G*, revealing its edges (i.e. pairing off the half-edges) 
while the epidemic spreads, as in ||2l| (see other variants in [ll; 0; 12; 3]). We call a half-edge 
susceptible, infective or recovered according to the type of vertex it is attached to. Unpaired 
half-edges are said to be free. Initially, each vertex i has di half-edges and all of them are 
free. 

Each free infective half-edge chooses a free half-edge at rate fd n > 0, uniformly at random 
from among all the other free half-edges. Together the pair form an edge, and are removed 
from the set of free half-edges. If the chosen free half-edge belongs to a susceptible vertex 
then that vertex becomes infective. Infective vertices recover at rate p n ^ 0. 

We stop the process when no infective free half-edges remain, which is the time when the 
epidemic stops spreading. Some infective vertices may remain but they trivially recover at 
i.i.d. exponential times. Some free susceptible and recovered half-edges may also remain. 
These could be paired uniformly to reveal the remaining edges in G* , if desired. However, 
this step is irrelevant for the course of the epidemic. 

In order to prove our results, we perform a time change in the process: when in a state 
with x\ ^ 1 free infective half-edges, and a total of x free half-edges of any type, we multiply 
all transition rates by (x — l)/f3 n x\ (this multiple is at least l/(2/5 n ), since x\ ^ 1 implies that 
x ^ 2). Then each free susceptible half-edge gets infected at rate 1, each infective vertex 
recovers at rate p n {x — 1)/f3 n x\ , and each free infective half-edge pairs off at rate (x — l)/x\. 

In the time changed process, let S t , I t and R t denote the numbers of susceptible, infective 
and recovered vertices, respectively, at time t ^ 0. Let S t (k ) be the number of susceptible 
vertices of degree k ^ 0 at time t. Then S t = YlkLo ^t(k) is decreasing and R t is increasing 
in t. Moreover, S 0 (k ) = ns,fc, Iq = and R 0 = Ur = 0. 

Also, we let Ag ,t, Ai )t and Ar^ be the numbers of free susceptible, infective and recov¬ 
ered half-edges, respectively, at time t. Then Ag )t = YlT=okSt(k) is decreasing, Ag )0 = 
YlT=o kns,k, Arc = kn\,k and Ar i0 = 0 (by our simplifying assumptions in Section l37L]h 

We denote the duration of the time-changed epidemic by 


T* := inf{t ^ 0 : A u = 0}. (3.4) 

At time T*, we simply stop, as said above. (The last infection may have occurred somewhat 
earlier, since the last free infective half-edge may have recovered or paired of with an infective 
or recovered half-edge. It follows e.g. from (13.101) below that the last actual infection w.h.p. 
did not happen much earlier, but this is irrelevant for our results, and we use (j3.4[) as the 
definition.) 


3.3. Concentration of measure. We will show that S t (k), Ag jt , A T t and Ar^ are uniformly 
close to certain deterministic functions. Let 


OO 


hs,n(t) 

:= ^ kn S,k e k \ 

(3.5) 


k =0 


,n(f) 

oo 

:= ^e _t (l - e -4 ) ^/cn fc , 

Pn ,_ n 

(3.6) 
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oo 


(3.7) 


hl'nit) := e 2 ‘ 2j kn k - hs, n [t) - h Rn (t). 
k =0 

Theorem 3.1. Let a n be any numbers with a n ^ a n = o(l) such that 

OO 

Y k 2 ni )k = o(n 2 a„). (3.8) 

k =0 

Then, for any fixed to < oo, 

OO 

Y SU P \St{k) - n s , k e~ kt \ = o p (na 2 n ), (3.9) 

fc=0 ts ? a " t oA T* 

sup \X s , t ~ hs, n (t)\ = o p (nal), (3.10) 

t^Olnto AT* 

sup \X R} t - /iR, n (t)| = O p (no£), (3.11) 

t^OntoAT* 

sup |W,t — hi tU (t)\ = o p (nal). (3.12) 

t^a-nto AT* 


The above result establishes concentration on time intervals of length 0(a n ). In Sec¬ 
tion 13.41 we use it to show that, for a suitable choice of a n , the duration of the epidemic 
satisfies T* = 0(a n ) w.li.p. It follows that the theorem then holds also with t 0 = oo, see 
Remark 13.61 

The remainder of this subsection contains the proof of Theorem 13.11 We first need two 
lemmas concerning the evolution of the number of susceptible vertices and the total number 
of free half-edges. 

In the time-changed epidemic, each free susceptible half-edge gets infected at rate 1, until 
T*. We further modify the process so that free susceptible half-edges continue to be infected 
at rate 1 even when there are no more free infective half-edges. Let S t (k) be the number of 
susceptible individuals of degree k in the modified process. Then (StAT*{k) : k G Z + ,t ^ 0) 
has the same distribution as ( S t /\T*(k ) : k G Z + ,f ^ 0), and so, to prove (13.91) and (13.1 0D . it 
suffices to prove that 

OO 

Y sup \St(k) - 7i Stk e~ kt \ = o p (nal), (3.13) 

k =o 

and 

sup \X s , t - h Sj n(t)\ = o p (nal), (3.14) 

t^Otnt 0 

where Xs.t = o k ^t(k)- For each t, let 

OO 

w t := Y k 2 (S t (k ) - n s , k e~ kt ). (3.15) 

k =0 


Lemma 3.2. Fix to < oo and assume a n ^ a n = o(l). Then Esup ts ; arito \W t \ = o{na n ), and 
hence 


E sup 

ty<5nio AT* 


Y k2 ( s t(k) - ns, k e 


—kt\ 


k =0 


= o(na n ). 
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Proof of Lemma HOI We enumerate the initially susceptible vertices as i — 1, 2,..., ns and 
denote by the degree of initially susceptible vertex i. Let Li be the time at which 
initially susceptible vertex i becomes infective (in the modified process). Then each L t has 
exponential distribution with rate dg,*, and the L, (i — 1,2,... , 7ig) are all independent of 
one another. It follows that, for each fixed t, the random variables F ht := dg,/!^^ —e~ tds ’' i ) 
each have mean zero and are all independent. Note that W t = ^i.t- 

Each \F iit \ is bounded by dg*, where, as in (12.111) . dg,* = maxjdg^. Hence, by Bernstein’s 
inequality for sums of bounded independent centred random variables, see e.g. [25|, Theorem 
2.7] or [ 9 , (2.10)], for each a ^ 0, 


ns 


P(|W t | >a) =P 


i —1 


> a ^ 2 exp 


Now, for any t ^ a n t 0 , using (I2.10p . 


ns 


ns 


2E"iEi? t + 2ad|,,/3 


n s 


i= 1 


2 E E = 2 E d l< Var (u,») < 2 E - e ~“ s ') 

2=1 2=1 
ns 

< 2 4,.f E dg i ^ ®ndg,* ^ ^ k Tlg^k ^ 2,CotflOi n 'ndg^ 


i= 1 


Furthermore, a n n 1//3 ^ cvn. 1 / 3 —> 00 and so by (12.111) . 

ds,* = o(n 1/3 ) = o((na n ) 1/2 ). 


(3.16) 


(3.17) 


(3.18) 


Thus, for n sufficiently large, ds,* ^ ( na n Y' 2 , and then for any u ^ 2coto and a = 
u(nd n ) 1/2 d s> *, by (I3.17j) . 


u 2 \ 

2 c 0 t 0 + 2udg i */3(nd ri ) 1 / 2 / 

f u 2 \ 

yi exp-^ exp (—m/2) . 

V 2c 0 t 0 + w; 1 ; 

Hence, by (I3.16p . for n sufficiently large and for each each t ^ toa n and u ^ 2c 0 to, 

P(|W t | > M(nd n ) 1/,2 dg i *) ^ 2 exp (—m/2) . (3.19) 

Note also that (nd n ) 1 / 2 dg i * = o(na n ) by (13.181) . Let u> n be an integer valued function 
such that cu n — y 00 and (nd n ) 1 / 2 ds,*a; n = o(na n ). We divide the interval [0,to<5 n ] into 
u> n subintervals [ 77 , t;+i], where 77 = lt 0 a n /u} n for l — 0 ,..., u n — 1 . 

Since S t (k) and e~ kt are both decreasing in t, each of the sums Y2T=o k 2 S t {k ) and YYk=o k 2 ns,ke 
is also decreasing in t. Thus, for any 0 ^ l < co n , 


exp 


2 E"iE^ t + 2a<il i ,/3 




exp 


sup | W t | ^ 

ri*Zt^Ti + 1 


^k 2 (S n {k)-n S:k e kn+1 ) 

k =0 


E fc W + ,m 

k=0 


00 

^ \W Tl \ + \W Tl+1 \ + 2j2k 2 n s , k (e- k ^-e- kT ^) 

k =0 
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n s ,fce feri ) 


kt 

























<iw^i+iw^«i+2x; k 3 7 i s , k ( ri + 1 - n ), 


k =0 


and so, since YJT=o k 3fl s,k < c 0 n and 77+1 — n — t 0 a n /u n = o(a n ), noting W 0 = 0 , 

sup \W t \ — max sup \W t \ < 2 max | W n | + o(na n ). (3.20) 

t^a n to l<Wn Ti^t^Ti- 1_1 l^l^UJn 

Now, for n sufficiently large and u ^ 2c 0 t 0 , by (I3.19|) . 

pf max \W T .\ > u(na n ) 1 / 2 ds,*') ^ 2ca n exp(— u/2). (3.21) 

For sufficiently large n, 2uj n ^ e cot °, and then (13.2ip holds trivially for u < 2coto too. Hence, 
for large n, 


E max | W Tl \ = (na n ) 1 ^ 2 ds t „ 


1 <1<UJ. 


P( max | W Tl \ > u(na n ) 1 ^dsA du 


^ (na n )^ 2 ds,* / 2 ca n e U ^ 2 du 

Jo 

= 4 (na n ) 1/2 ds,*u>n = o(na n ), 


(3.22) 

and hence also Esup t<5rito \W t \ — o(na n ) by (13.201) . □ 

We now prove a concentration of measure result for the total number X t of free half-edges. 
Lemma 3.3. For every fixed t 0 > 0, and a n ^ a n = o(l), 


sup 

t^anto A T* 


-W - e 2t ^2 kn k = o p (n<3 2 ). 

k =0 


(3.23) 


Proof. When in a state with x\ ^ 1 free infective half-edges, and thus x ^ 2 free half-edges 
in total, each free infective half-edge pairs off at rate ( x — l)/xi, and so the number of free 
half-edges decreases by 2 at rate x — 1. We modify the process so that pairs of free half-edges 
still disappear at rate x—1 when there are no more free infective half-edges (as long as 
x ^ 2). Let X t be the number of free half-edges at time t in the modified process. Then it 
suffices to prove that 


sup 

t^Oint 0 


X t - e 2t 2_^ KU k 

k =0 




°p\ na n) 


Now, X t — 1 is a linear death chain starting from Yl'k=o^ nk ~ 1) and taking jumps from 
state j to j — 2 at rate j. By [20|, Lemma 6.1], with d = 2, 7 = 1, and x = J2T=o k' n k ~ 


E sup 


(X t - 1) - e _2t kn k - l) ^ 16(e 2 “" to - 1 ) kn k + 32. 


k =0 


k =0 


But ^2^L 0 krik = O(n) by (D3) a n t 0 = o(l) and na n ^ na n —* 00 by (D4) so the right- 
hand side is 0(nb n ), and so sup t ^ nto \X t — e~ 2t J2T=o k n k\ = O v (\/rmJJ) = o p (nai 2 ), using 


(D4) 


□ 
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Proof of Theorem \3. 1[ We start by proving (13. 10[) . and, as remarked after the statement of 
Theorem 13.11 it is enough to prove that 


sup Q^kS^k) - hs, n (t ) 

t^atnto k=0 


o P (na 2 n ). 


(3.24) 


Now, for each k, ( St(k )) is a linear death chain starting from ns, k and decreasing by 1 at 
rate kx when in state x, and so 

S t (k) = n s ,k ~ k f S u {k)du + M t (k), (3.25) 

Jo 

where M(k ) = ( M t fk )) is a zero-mean martingale. It follows that 


X] kS t fk) - h s , n (t) = k(S t (k ) - n S}k e~ kt ) 

k =0 k =0 

oo 

= -J2 k l (£«(*) - n s,k e ~ ku )du + Mt 

k =0 

W u du + M t , (3.26) 

where M t = 'Yh k kM t {k) defines a zero-mean martingale M = (M t ). 

Since S t (k) and S t (j ) with k ^ j never jump simultaneously, we have [M(k),M(j)\ = 0, 
where [•, •] is the quadratic covariation, see e.g. jHI, Theorem 26.6]. Since also each jump of 
St(k) is by —1, the quadratic variation [M\ t := [ M,M] t is 



OO OO 

[M], = £ k 2 [M(k)] t = £ * 2 ]T(A S u (k)f 

k =0 k =0 u^t 

oo oo 

= - 5> 2 Z)( A S«(*0) = E- St(k)) 

k =0 u^t k =0 


= k2 ^e~ kt ~ S t fk ) + ns, k ( 1 - e~ kt )) 

k=0 


< \ J2 k2 (St(k)-n s , k e- kt ) 

k=0 


= \W t \+0(tn), 


OO 

+1 ^ k 3n s,k 

k=0 


again using (12.101) . By Lemma [3721 Esup t ^ a7it |Wt| = o(na n ), so 

E[M] 5nt0 = 0(na n ), (3.27) 

and so, using the Burkholder-Davis-Gundy inequalities j22|, Theorem 26.12], sup^ dnf \M t \ = 
O p (y/na n ). Hence by (I3.26jl and Lemma [3721 uniformly in t ^ <3 n t 0 , 


5E fc (v t (fc) 

k=0 


ns, k e 


—kt\ 


< 


W v du 


+ O p (y/‘ 


na r 
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^ a n t 0 sup \W t \ + O p (v^ 

t^CXnt 0 
2 \ 


Op ( TUX n y, 


using again 


(D4) This establishes (13.1011 . 


Next we prove (13.911 . By [20, Lemma 6.1] with d = 1 and 7 — k and x = ns t k, for k ^ d n 1 , 

^ —fct 12 ^ A/^kanto _ ^ Ab™ . (3.28) 


E sup \S t (k) - n s ,ke | 4(e ' Qn 0 - l)n s ,fc ^ 4 ka n t 0 e °n s ,fc, 

t^a n to 


where the last step uses the simple inequality e x — 1 ^ xe x for x ^ 0. For k > <5 n , we 
use the trivial bound | S t (k) — ns,kG~ kt \ ^ 7is,fc- Using Jensen’s inequality and then the 


Cauchy-Schwarz inequality, as well as (12.10)1 and (D4) 

OO 

E^ sup | S t (k) - n s ,ke~ kt 


k =0 


t^OLnto 


< Y (4ka n t 0 e to n s ,k) 1/2 




Y ns,k 

1/2 


< 2 (a n t 0 e to ) 1/2 Y k 2 Y. k 3 n s ,k 


+ cu 


Y k3,ns ’ k 


K k =1 


fc=l 


k =1 


= 0 (\/nd^) + O(nci^) = 0 ( 7107 ), 
which yields (13.13)1 and thus (13.9(1 . 

We now prove (13.1111 . The number of free recovered half-edges changes when either an 
infective vertex recovers or a free infective half-edge pairs with a free recovered half-edge. In 
the time-changed process, when in a state with x\ free infective half-edges and x free half¬ 
edges, infective vertices recover at rate p n (x — l)/j3 n xi. Also, each free recovered half-edge 
is chosen to be paired at rate 1 , and thus the number of recovered free half-edges decreases 
by 1 at rate x R . Hence, for any t ^ 0, 


rtAT* 


X- 


R.tAT* 


= X 


R,0 


Pn 


rtAT* 


X R , s ds + — / (X s — l)ds + , 

Pn Jo 


(3.29) 


where Mr = (M Rjt ) is a zero-mean martingale. 

On the other hand, differentiating (13. 6 p reveals that 

Pn _2 


I'hiJt) = -hR, n (t) + ^e 2t Y krik - 

Pn 


(3.30) 


k =0 


Hence, subtracting the integral of that expression from (13.29)1 . and recalling that X R0 = 0, 


X Rl tAT* ~ k R , n (t A T*) 


rtAT* 

^ / | A^r,sat* — h,R, n (s AT*)\ds 

Jo 

•tAT* 00 

—2s 


+ 


rtAl ^ 

- X s — 1 — e~ 2s Y kn k ds + |M R)tAT 

n k =0 


Then Gronwall’s inequality yields 


sup \X R)t - h Rtn (t) | < e ant °a n t 0 

t^a n toAT* 


Pn 


sup 


Pn \ t^&ntoAT* 


x t - e 2t Y kn * 


+ 1 


k =0 
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+ e &nt ° sup |Af R>t |. (3.31) 

t^a n toAT* 

Since p n //3 n is bounded and a n —> 0, the first term on the right-hand side is o p (na 2 ), by 
(I3.23p . It remains to show that the same is true of the martingale term. 

Note that X^ t jumps by —1 when a free recovered half-edge is paired with a free infective 
half-edge, and it jumps by +k when an infective vertex with k free half-edges recovers. Also, 
each recovered half-edge or vertex was either initially infective or was initially susceptible 
and then became infected prior to recovery. Hence 


E[M R k toAT * - E[ (AM r , s ) 2 ] 

s^a n toAT* 

oo oo 

< E ^2 k(ni,k + ns,k - Sa n t 0 AT* (k)) + E ^ k 2 (m,* + n s ,k ~ S &nto AT* (k)) 

k =0 k =0 

oo oo 

< 2 ^ k 2 ni )k + 2 E ^2 k 2 (n s , k ~ S &Tlt 0 at* (k)) 

k =0 k =0 

oo oo 

= 2^ k 2 m, k + 2E^fc 2 (ns, fc (l - e - k ^toAT^ +nske -k(a n t 0 AT*) _ s antoAT *(k )) 

k=0 k =0 


oo oo oo 

< 2 ^ k 2 rii )k + 2 a n t 0 ^2 k 3 n s ,k + 2 E k 2 (n s ,fee _fe(Q " toA ' r * ) - S antoAT * (k)) 

k=0 k=0 k=0 

= o(n 2 a*) + 0(na n ) + o(na n ) = o(n 2 d„) 


by (13.81) . (12.10]) . Lemma [3721 and na\ ^ no? n —* cx), see (D4) Then, by the Burkholder- 
Davis-Gundy inequalities, sup t ^ fi t AT » \Mn >t \ = °p( na n )> and (13.111) follows by (I3.31D . 


Finally, (13.121) follows from (13.101) . (13.111) . (13 .231) . and the fact that A If = X t — X s 


3.4. Duration of the time-changed epidemic. We stated Theorem 13.11 using a rather 
arbitrary a n , but from now on we fix it as follows. We distinguish between the cases v < oo 
and v = oo, and introduce some further notation: 

If 0 ^ v < oo, define 


(3.32) 

/(f) := v + t- ft 2 , (3.33) 

x := (1 + \Jl + 2 u\ 3 )/X 3 . (3.34) 

If v — oo, define instead 

tin := (y^kn^k/n) , (3.35) 

k=0 

f{t) ■= 1 - if* 2 , (3-36) 

x := v/2/V (3.37) 
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Note that in both cases, x is the unique positive root of /, and that /(f) > 0 on (0, x) and 
/(f) < 0 on (x, oo); we have /(0) = 0 if v = 0 but /( 0) > 0 if v > 0. Note further that in 
the case v = oo, a n /a n —> oo by (12.81) : in particular, a n ^ ct n except possibly for some small 
n that we will ignore. Moreover, a n — > 0 by (D4) (z/ < oo) or (D5) (v = oo). 

Next, if v < oo, then, by (12.81) . 


E kni ’ k = 0 { n s a l ) = 0(na 2 n ), 


k =0 


and if v = oo, then by (13.351) . 


Hence, in both cases, 


E kni t 


k = na n . 


(3.38) 


(3.39) 


k=0 


E = O (nd 2 ). (3.40) 

k=0 

Furthermore, if z/ = 0 then (12. 8p yields kni t k = o(na 2 ) and thus 

oo oo 2 

E fc2y h,fc ^ (E kn i,k) = o(n 2 a4), (3.41) 

Zc=0 fc=0 

and if 0 < u ^ oo then (j2.9[) and (I3.40P imply 

oo oo oo 2 

E ,k < dl,* E = 0 (E = °( n 2 < 5 n) • ( 3 - 42 ) 

/c=0 k= 0 /c=0 

Hence, (13.81) holds in all cases. 

We have verified that our choice of a n satisfies the conditions of Theorem 13.11 so Theo¬ 
rem [XU applies. We use this to show a more explicit limit result for X\ t . 

Lemma 3.4. For any fixed to, 

Xl a n t 


sup 

/a n ) 


nai 


~f(t ) 


0. 


(3.43) 


Proof. The idea is to combine Theorem 13. 11 with a Taylor expansion of h^ n (t) around zero. 
The first three derivatives of h^ n (t) are 


k =o 

OO 


~~ kt _ (fa 

Pn 


k =0 
oo 


/i^(f) = -2e 2t E kn k + E fe2n s,fce kt + TT e *(l-2e *) E 

fc=0 fc=0 Pn 

oo oo 

M'nW = 4e~ 2< E hlk - E 

+ ^e _4 (l - 8e _t ) E 
Pn 


k =0 
oo 


''^s.fce - — e ~ (1 4e *) E knk i 


k =0 
oo 


^'l" n (f) = ~ 8e 2t E kn k + E ,fce ** ' ^ * 


(3.44) 

(3.45) 

(3.46) 


Hence, using 


Zc=0 fc=0 

, (I3.40P and a n —> 0, 

OO OO 

p 


k =0 


= -2 e knk + E fc2n s,fe - E krik = ns " n - E = noin+ ( 3 - 47 ) 

A- 


k=0 


k =0 


k=0 


k=0 
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and similarly, using also (I2.15j) . 

OO OO OO 

= 4 ^ kn k ~ fc 3 n s,fc + 3^ ^ kn k 


k =0 


/c=0 

OO 


' 10 k =0 

OO OO 


3(1 + pn/fin) ^ kn k + fcn S,fc _ 

/c=0 k= 0 /c=0 /c=0 

OO OO OO 

3(1 + p n /p n ) ^ kn k - k<[k ~ 1 )( k ~ 2 + 3 ) n S,fc + ^ kni, 

k= 0 k= 0 /c=0 

OO 

-3 n s a n - y 'k(k- 1 )(k - 2)n s , fc + O(nd^) = -n\ 3 + o(n). 


k=0 


Also, for t ^ 0, 


K'nWI < (8 + 7 pn/Pn) ^ ^ ^s.aT kt = 0{n) + ^ k 4 n s , k e kt . 

k =0 /c=0 /c=0 

Hence, for any M ^ 1, 

/ ctn^O 

|/^" n (t)| df =/ 0(nd n ) + a n f 0 ^ A/ 4 ?r S)fc + ^ k 3 n s , k { 1 - e~ koint °) 

k^M k>M 


< 0(M 4 na n ) + ^ fc 3 n s , fc = o(n) + ^ /c 3 n s>fc . 


k>M 


k>M 


Letting M —* oo slowly (so that M 4 a n = o(l)), and using (D2) we obtain 

CCX-nt 0 


lim — 

n—>•oo Tl 


\ti" n (t)\ dt = o. 


f*OLnt 


(d n t — u) 2 h'" n {u) du, 


Now, by a Taylor expansion, for t ^ 0, 
hi,n{di n t) = V(0) + hJ n (0)<S n f + §h" )7l (0)(d: n t) 2 + 
and hence, using h In (0) = J2 k kni,k, (13.471) . (13.48ft and (13.49ft . uniformly in t ^ f 0 , 

OO 

^ 1 ,71(^71 *0 ^ ^ “I - TlOL n (Xrit T1A3 “I - o(? 7 /(a n ). 


/c=0 


If v < oo, then d n = <a n , and ( 13.51H yields by ( 12.8ft and ( 13.3ft 

L4M = „ + 1 _ 1(^3 + 0(1) = m +0(1). 

na n 

If v — oo, then (13.511) yields similarly by (j3.35[) and a n = o(a n ), 


hl^n (®n^) 


— 1 — |f 2 A 3 + o(l) — /(f) + o(l). 


nai 


Consequently, in both cases /q iTl (a n f)/na 2 = /(f) + o(l), uniformly for 0 ^ f ^ f 0 , 
result follows by combining this and (13 . 1 2j) from Theorem 13 . 1 1 
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(3.48) 


(3.49) 

(3.50) 

(3.51) 

(3.52) 

(3.53) 

and the 

□ 
























We can now find (asymptotically) the duration T*, except that when v = 0, we cannot 
yet say whether the epidemic is very small or rather large. 

Lemma 3.5. (i) If 0 < u ^ oo, then T*/a n —^ x. 

(ii) Ifv = 0, then for every e > 0, w.h.p., either 

(a) 0 ^ T*/a n < e, or 

(b) \T*/a n - x| < s. 

In particular, in both cases, w.h.p. T* ^ 2xd n . 

Proof. Take t 0 = 2x. Then /(to) < 0, so (13.431) implies that P(T*/d n ^ to) —> 0, i.e., 
T* < t 0 a n w.h.p. Consequently, we may w.h.p. take t = T*/a n in (13.43[) and conclude 
\Xi T */na 2 — f(T*/a n ) | — ^ 0. Since Xi T * = 0 by definition, this says f(T*/a n ) — 'h- 0. 

Consider /(t) for t G [0, oo). If v > 0, then /(t) has a unique zero at x, and is bounded 
away from 0 outside every neighbourhood of x; hence T* ja n —x follows. If v — 0, 
/(t) = 0 both for t = 0 and t = x, and flu]) follows. □ 

Remark 3.6. Lemma 13.51 shows that taking to := 2x in Theorem 13.11 we have w.h.p. 
d n t 0 AT* = T* , and thus, for a n as above, Theorem 13.11 holds also with the suprema taken 
over all t ^ T*. 

3.5. Final size. 


Proof of Theorem \2.3[ Recall that Z k := ns, k — Sr*(&) is the number of susceptibles of degree 
k that ever become infected, and Z = 'ff /k Z k . For each k e Z + , 

S T *(k ) - n s ,ke~ kT * 


Z k . T* 

— - kp k — 

tiol t (y, 1 


< 


not T,. (y 


+ 


rift, 


Since 11 — e 27 — y| ^ y 2 for all y ^ 0, and using (Dl) (D2) and (13.3j) . 

Z kan t)ns,i 


E 


(i 


na. 


- kp k t 


< t ^ - n s ,fc/n| + a n t 2 ^ - -* 


fc =0 fc =0 fc =0 

uniformly in t ^ to := 2x. Since T*/a n At t 0 w.h.p. by Lemma [3.51 it follows that 

n s , fc (l - e~ kT *) 


E 

k=0 


^—kT*\ rp* 


kpk — 


= Opl 1 )' 


Further, by 


noc n a r 

in Theorem 13.11 and Lemma 13.51 see Remark 13.61 

S T *{k) - n s , k e~ kT * m 

= Op(a n ) = o p (l). 


E 

k =0 


na r 


It follows by (I3.54p that 


and, in particular, 


E 

k=0 


. T* 

— - kp k — 

na n o^r?. 


= Op(l) 


Z X T * 

_ 

E (^-kptP) 

na n 


\ na n a n J 

k =o v n n/ 


= o P (l). 
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(3.54) 


(3.55) 


(3.56) 











































The estimate (13.561) and Lemma [3751 yield the conclusions (i) -(iii) by (j3.3[) and the definitions 
of a n and x in (j3.32j) — (13.37)1 . For (i), when v = 0, we first obtain that if e n = e > 0 is fixed 


but arbitrary, then the conclusion holds w.h.p., and it is easy to see that this implies the 
same for some sequence e n —* 0. (Note also that if u — 0, then x = 2/A 3 .) 

Furthermore, combining (j3.55j) and (I3.56p . we obtain 


OO 

£ 

k =0 


Zk 

ncL 


kp k Z 


A na r 


= o P (l)- 


(3.57) 


We have shown that, except in the case (ij[a) there exists e > 0 such that w.h.p. Z/na n ^ e; 
then (12.17ft follows from (I3.57p . □ 


4. Proof of Theorem 12.41 

We continue to use the simplifying assumptions in Section 13.11 We consider the epidemic 
in the original time scale and construct it from independent exponential random variables. 
At time 7 = 0, we allocate each of the rij initially infective vertices an Exp(p n ) recovery 
time. We also give each free infective half-edge at time 0 an Exp(/3 n ) pairing time. If the 
pairing time for a free infective half-edge is less than the recovery time of its parent vertex, 
then we colour that free half-edge red. Otherwise, we colour it black. We now wait until the 
first recovery or pairing time. At a recovery time, we change the status of the corresponding 
vertex to recovered. At a pairing time of a red free half-edge, we choose another free half¬ 
edge uniformly at random. If the chosen free half-edge belongs to a susceptible vertex then 
that vertex becomes infective, is given an Exp(p n ) recovery time, and its remaining free 
half-edges are given independent Exp(/9 n ) pairing times. Then, as above, we colour red any 
free half-edge with pairing time less than recovery time, and colour black all other free half¬ 
edges at the chosen vertex. The process continues in this fashion until no red free half-edges 
remain. Note that we do nothing at the pairing time of a black free half-edge, since it is no 
longer infective, and so black free half-edges behave like recovered free half-edges. Also, a 
red free half-edge will definitely initiate a pairing event at some point (provided it has not 
been chosen by another red free half-edge first). However, ignoring the colourings we obtain 
the same process as before. 

Let Z t be the number of red free half-edges at time t ^ 0. Note that Z t changes only at 
pairing events, but not at recovery times. (The point of the colouring is to anticipate the 
recoveries, which then can be ignored.) Further, let Z m := Zx m , where T m is the time of 
the m:th pairing event (and T 0 := 0), and let ( m := A Z m := Z m — Z m _j. (Note that our 
processes are all right continuous, so Z m is the number of red free half-edges immediately 
after the m-th pairing has occurred and we have coloured any new infective free half-edges.) 
Thus the process stops at T mt , where m* := min{m ^ 0 : Z m = 0}. (This is not exactly 
the same stopping condition as used earlier, but the difference does not matter; there may 
still be some infective half-edges, but they are black and will recover before infecting any 
more vertex.) Let T m = .F(T m ) be the corresponding discrete-time filtration generated by 
the coloured SIR process up to time T m . 

We keep the same notation as before for the free half-edge counts (so the total number of 
free infective half-edges, whether red or black, is X\ t ^ Z t , for example), and write again 
Sf(k) for the number of susceptible vertices with k free half-edges at time t ^ 0. Furthermore, 
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define 


Pn 

Pn T Pn 


(4.1) 


the probability that a given free infective half-edge is coloured red. Note that c ^ 7r n ^ 1 
for some c > 0 by (12.201) . 

We begin by showing that a substantial fraction of the initially infective half-edges are 
red. Recall that A I;0 = YPk =o ^ n i,k is the total degree of the initially infective vertices and 
that di * is the maximum degree among these vertices. 


Lemma 4.1. Suppose that Ai >0 —> oo. 

(i) Ifdi ,* = o(Ai >0 ), then Z 0 = n n Xi fi (l + o p (l)). 

(ii) More generally, for any d\^, we have 

lim lim sup p ( z 0 ^ dXio^ = 0. (4.2) 

| 5 — ^0 n^HX) ' ' 


Proof. We enumerate all initially infective vertices as i — 1,..., and let be the degree 
of vertex i, so that Xi ;0 = X);=i We also let Z Q l be the number of red free half-edges 
at vertex i, so Z 0 = Zo,i, where the Z 0}i are independent, with E Z 0ji = di ; j7r n and 

Zo,i ^ dp*. It follows that E Z 0 = = ^n^i,o and 


n\ ni 

Var Z 0 = ^ Var Z (ll ^ ^ d 2 * ^ di,*Ai,o- 

i=l 2 — 1 


(4.3) 


(i) If di,* = o(Ai,o), then (14.31) yields Var Z 0 = o(Aj 2 0 ) = o((EZ 0 ) 2 ), and thus Chebyshev’s 
inequality yields Z 0 = EZ 0 (1 + o p (l)). 


We assume first that dj * ^ d 1 / 2 .A I0 . Then (14.3j) and Chebyshev’s inequality yield 


P(Z 0 ^ SX I>0 ) ^ 


Var Z n 


(E Z 0 — dX Ij0 ) 




dl ’* Xl ’° < 4vr -2 

;|7T n Al,o) 2 " n 


d 1 / 2 



(4.4) 


Assume now instead that dj.* ^ d 1 / 2 Ai >0 . Fix one initially infective vertex of degree dj 
let Zo ; * be the number of red free half-edges at that vertex, and let R* be its recovery time. 
Then Z 0 ^ Z 0) *, and so Z 0 ^ dXi j0 implies that Z 0 ^ d^di,*. We have 

P(^o,* ^ d 1 / 2 ^,*) = P(Z 0> * < S^d^R* ^ 4dV 2 //3 n ) +P(Z 0i * ^ > 48 1 / 2 / p n ) 

^ P(R* sC 4 5 1/2 /p n ) + P(Z 0 ,* < d 1/2 di,* | R, > A8 1/2 /p n ). (4.5) 


Now, 

P (R* ^ ^ 2 /p n ) = 1 - e -« 1/a P»/^ ^ 48 1 / 2 p n /p n . (4.6) 

Also, conditional on R+ = r, Z 0i * has a binomial distribution with parameters dj* and 
1 — e~^ nT . It follows that, conditional on R* > 4 8 1 ^ 2 /P n , Z 0i * stochastically dominates a 
Bin(di ) *, 1 — e _45l/ ") random variable. For d small enough, 1 — e _4(5l/2 ^ 2d 1 / 2 , and so, by 
Chebyshev’s inequality, 


P 


Z 0 ,* V d^d 


I,* 


R* > 


4d 4 / 2 \ / 

) V P(Bin(d Ii „2d 1 / 2 ) ^ d 1 / 2 d Ii * 
P?1 ' ' 




2d 1//2 d 


i,* 






dA, 


1,0 


(4.7) 
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Combining (I4.4H . (14.5H . (14.6p and (14.7p . we see that if 5 is small, then in both cases 


P (Z 0 < 6X Ii0 ) < 4 


Pn 4“ fir 
fin 


/ sx 1>0 


and (14.2ft follows, since Ai )0 —> oo and p n /fi n = 0(1) by (I2.20p . □ 

Lemma 4.2. Let (W m )“ =0 fre a process adapted to a filtration (L m )“ =0 , with W 0 = 0, and 
let t ^ oo 6e a stopping time. Suppose that the positive numbers v,w > 0 are sac/i that 

E[AVF m+ i | Am] ^ u a.s. on {m < r}, (4.8) 

E[(Aiy m+1 ) 2 ] < W (4.9) 

for every m ^ 0. Then, for any b > 0, 


P ( inf lF m < -6 ) ^ 

, O^msgr / OV 


(4.10) 


Proof. Consider the Doob decomposition 

W m = M m + A m , (4-11) 

where A m := E[A Wi | Jy_i] is predictable and M m := W m — A m is a martingale with 
respect to (J 7 m ). By the assumption (14.8p . A m ^ mv a.s. when m ^ r. Furthermore, 

E[AM 2 ] = E[(AW m - E[AW to | A,^]) 2 ] = E[(AlD m ) 2 ] - E(E[AW m | F m _ x }) 2 < w. 

Thus, by Doob’s inequality, for any A ^ 1, 

p( inf W m ^-b) < pf inf M m < -b - Nv) < 

\N^m^(2N)AT J \N^m^2N J 

Summing over all powers of 2, we obtain 


E[Mf 


27VJ < 2Au> 


(6 +An) 2 (6 +An) 2 ' 

(4.12) 


OO oo 

p( inf W m < -ft) < Vpf inf W m ^ -&W V 

VlsSm^r / \2 fe <m<2 fc + 1 At / 

fc =0 fc =0 


2 fe+1 w 
fib + 2 fc r) 2 




E 


2 k+1 w ^ 2 fc+1 rr A(b/v)w 4 fiv/b)w 8 w 

6 2 2 2fc u 2 ^ 6 2 u 2 bv 

2 k ^b/v 2 k ^b/v 

Proof of Theorem [A iPIl Let 5 > 0 be a small positive number chosen later. Define the 
discrete stopping time m** by 


rn** := min jm ^ 0 : Z m = 0 or ^ k 2 (ns,k — S Tjn {k)) > (5na r 

fc=o 

Note that for m < m**, the total number of free half-edges at time T m is 

OO OO oo 

^~Tm ^ E ^ ^ ^ 8TlOL n ^ ^ ^ STlOl. n o(72Q( n ), 

/ c =0 fc =0 / c =0 

since = o(nsc^) = o(no; n ) by (12.8p and (12.6)) . Similarly, for m < ra**, 

oo 

= Zr m ^ Z 0 + A:(n S| fc - S Tm (k)) ^ E 0 + 5na„ ^ <5na n + o(na„), 

fc=o 


(4.13) 


(4,14) 


(4.15) 
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since Zq is bounded above by YlT=o^ n i> fc = °( n s a n) — o(na n ). 

At a pairing m + 1 ^ m**, a red free half-edge pairs with a free susceptible half-edge, or 
with another red free half-edge, or with a black half-edge. In the first case, if the susceptible 
half-edge belongs to a vertex of degree k, we get on the average 7 r n (k — 1) new red free half¬ 
edges; in the second case we instead lose one red free half-edge, in addition to the pairing red 
free half-edge that we always lose. The probability of pairing with a susceptible half-edge 
belonging to a vertex of degree k is kSr m (k)/X Tm and the probability of pairing with another 
red free half-edge is Z m /XT m - Hence, for m + 1 ^ m**, using fl4.13p - fl4.15lh f!4 .1 1) and the 
definition (12.1 p of Ho, 


K[AZ m+ i 


-p 1 > , _ HV=o(k ~ l)kS Tm (k) _ Zrn _ 

Er=o kn k Er=o kn k - (5 + o(l))na n 

^ 1 J2kLo( k - l ) kn s,k - Sna n (5 + o(l))na n 

' U Er=o kn k EZokn k -(5 + o(l))na, 

S? —1 + H 0 — 0(8a n ). 


Since (Ho — l)^” 1 is bounded away from 0 by (12.31) and Remark 12781 this shows that as long 
as 6 is chosen small enough there exists some ci > 0 such that if n is large and m < m**, 
then 

E[AZ m +i | T m \ A ci« n . (4.16) 

Furthermore, noting that the number of red free half-edges may change by at most k at a 
jump if a red free half-edge pairs with a free susceptible half-edge at a vertex of degree k, 
the expected square of any jump satisfies, for m < m**, 


E[(AZ m+ i)" | J- m ] ^ 4 + 


Er= 0 


S,k 


Efclo kn k ~ (S + o(l))na r 


< C 2 , 


(4.17) 


for some C 2 > 0, uniformly in all large n, by assumption (D2) 
Let W m Z m /\m* t 


-Zq. It follows from (I4.16p - fl4.17p that Lemma [4721 applies with r = m 


** 5 


v = cia n and w = C 2 - 

Let a = a n > 0 satisfy a n —* 00 and a n = o(a n EfcLo as n °°- Then Lemma 14.21 
with b = a n /a n yields 


( 


P I inf 


W rn A 


~^nf 


< 


8 c 2 

ClCin 


= 0 ( 1 ) 


(4.18) 


and thus W rntt > —a n /a n w.h.p. On the other hand, Lemma 14 . 1]l ii) implies that P (Z 0 ^ 
a n/ a n) 0. Consequently, = W mtt + Z 0 > 0 w.h.p. By (14.13p . this means that w.h.p. 
E^lo k2 ( n s,k ~ ^T mtt (k)) > 8na n , and thus that, for some time t before the epidemic dies 
out, EZok 2 (n s ,k ~*S t (k )) > 5na n . 

The latter statement does not depend on the time-scale, so it holds for the time-changed 
epidemic in Section 13.21 too. Thus, using the notation there, by the monotonicity of the 
number of susceptibles, w.h.p. 


^2k 2 (n s ,k - S T *(k )) > Sna n . (4.19) 

k =0 
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On the other hand, Lemma [3.21 (with a n = a n ) and (12.10p yield, for any e > 0, 


snp 

0<t^ea„AT* 


^k 2 (n Stk - S t (k )) 


k =0 


^ sup 

t^sa„AT 


I ^2k 2 (S t (k) - n s . 


k& 


—kt\ 


k =0 

< o p (na n ) + £c 0 na r 


“f" SCX 1 


k3ns ’‘ 


k =0 


(4.20) 

Consequently, if we first choose 5 > 0 so small that (I4.19p holds, and then e < <5/c 0 , then 
(14.19ji — (I4.20n imply that w.h.p. T* > ea n . It follows by Lemma fUH that T*/a n — 1 —>• x — 2/A3, 
and thus the result follows by (I3.56p . □ 


To study the cases 


anc 


we analyse the number of red free half-edges more carefully. 


Let the random variable Y(k ) be the number of new red free half-edges when a vertex of 
degree k is infected. Given the recovery time r of the vertex, Y(k) r^j Bin(fc — 1,1 — e /3nT ), 
and, since r ~ Exp(p n ), the probability 1 — e~^ nT has the Beta distribution B{1, p n /fi n ). 
Consequently, Y(k) has the beta-binomial distribution with parameters (k — 1,1, p n /fi n ). 
More generally, if D is a positive integer valued random variable, then Y(D) denotes a 
random variable that conditioned on D = k has the distribution Y{k). We have the following 
elementary result, recalling the notation (14.11) . 


Lemma 4.3. For any positive integer valued random variable D, 

E Y(D) = vr n (E D- 1), 

E Y(D') 2 — — ~ k){2D — 3) + TT n E (D — 1) 

1 + n n 

Proof. For each k ^ 1, we obtain by conditioning on the recovery time r, 
EY(k) = (k- 1) 1 . = (k - l)7r n , 

1 + Pn/fin 

EY (k ) 2 = ( fc ~ l)(2fc~2 + p n //3 n ) = (fc-l)(2/c-3)7T, 2 1 + (fc-l)7T rt 

(1 + Pn/fin) (2 + Pn/fin) 1 + ^n 

and (I4.2ip ~ fl4.22p follows by conditioning on D. 


(4.21) 

(4.22) 


(4.23) 

(4.24) 
□ 


Let A be a constant, and consider only m ^ M := |_Aa“ 2 J. At pairing event m for m ^ M, 
the number of free half-edges is at least kn k — 2Aa~ 2 ^ kn k ■ (1 — Ain^ l a~ 2 ) for some 

constant A\. Thus, the probability that a susceptible vertex with t half-edges is infected is 
at most, for A 2 2A 1 and large n, 


£71 


s,e 


Jfik kn k ■ (1 - Ain 1 a n 2 ) 


W—A ^ (! + A 2 n 1 a n 2 ) 


hi, 


si 


Efc kn h 


Let D + > 1 be a random variable with the distribution 


P (D + > 3) ■= 


j) := min ^(l + A 2 n l a n 2 ) ^^^ k , 1^ , j ^ 2, 


(4.25) 


(4.26) 


and let C + := Y(D + ) — 1. (Note that D + and C + depend on n, although we omit this from 
the notation.) Then := A Z m , conditioned on what has happened earlier, is stochastically 
dominated by C + - Hence, there exist independent copies (Cm)i° °f C + such that Cm ^ Cm 
for all m ^ M such that the epidemic has not yet stopped; furthermore, (Cm)i° are a lso 
independent of Z 0 . If the epidemic stops at m* < M (because Z mt = 0 so there are no more 
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pairing events), then we for convenience extend the definition of C m and Z m to all m ^ M 
by defining Cm := Cm f° r m > m *i an d still requiring ( m = A Z m . Consequently, Cm Cm f° r 
all m ^ M and thus the (possibly extended) sequence (. Z m is dominated by the random 
walk (Z+)$* with Z+ := Z 0 + i C + - 
Next, observe that (I4.26P implies 


ED + — 1 


X> (D+^j)^Yl 


3 =2 


3 =2 


'Yhk^j kn S,k 

Efc kn k 


Efc( fc - i)kn s>k 
Efc kn k 


(4.27) 


and also, since na\ —>• oo, 


fp (D+ > i) < f(l + = (1 + (4.28) 

8—9 „•— o Z^k k Z/k k 


3 =2 


J=2 


It thus follows that 


E D + - 1 = VP(D+ ^ j) = (1 + oK ))^. 

22k kn k 


(4.29) 


Hence, using ( 14.21}) . (14. ip . (12.ip and (I2.22p . 

Pn 


EC + = 


E(Z7+ - l) - l = (l + 0 (a w )) „ ^ 1)kUS ’ k - 1 


Pn A p n Pn, 4“ p n TCb k ^k 

— (l + o(a n ))77o — 1 = 77 q — 1 + o(a n ) = A 2 1 a n + o(a n ). 


(4.30) 


Note also that (14.2611 . using (12.4p . (12.61) . (12.711 . shows that as n —* oo, Z7 + —> l) s , where l)s 


has the size-biased distribution P(.Dg — k) — kp^/X. Moreover, it follows easily from (D2) 
and (I4.26P that D + is uniformly square integrable as n —» oo, and thus 


E(D + y ->■ E(D S ) = 


EO| 

Er.o*=Pt EB s 


(4.31) 


and similarly 


E D + —>EDs 


EZ7|_ 
E Ds 


(4.32) 


By (I4.30P and a n —> 0 we have EC + —> 0, and thus 7r n (E.D + — 1) —> 1. Hence, by (I4.32p (or 
directly from (12.1H ). 

_ Pn E-P g _ 

7111 ~ Pn + Pn ^ ED S (D S ~1) ~ Y 2 
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(4.33) 







































Since |C + | ^ D + , it follows that also C + is uniformly square integrable as n —> oo. Fur¬ 
thermore, E Y(D + ) = 1 + EC + = 1 + o(l) and thus, using (14.221) and (j4.31|f — (j4.33H . 

VarC + = Var (Y(D + )) = E {Y(D+) 2 ) - (l + o(l )) 2 
r 2 

■ E(D + — 1)(2D + — 3) + 




7Tn 


-+ 


1 + Tin 

A 2 


1 + 7T n 

^D s (D s — 1)(2D S — 3) 


A2 (A2 + A) 
A(2A3 + A 2 ) 
A2(A2 + A) 
2 AA 3 

A2(A2 + A) 


+ 


A9 


A2 + A 


ED S 
1 


+ 


E(D + - 1 )-l + o(l) 
A ED s (hs — 1 ) 


A2 + A 


E Dq 


- 1 


= : cr 


(4.34) 


Now consider a n (Z 


M 


Z 0 — ME( + ) = a n ^2\=i n J (C t ~ EC + )- The summands are 
i.i.d. with mean 0, and the uniform square integrability of () + implies that the Lindeberg 
condition holds; thus the central limit theorem [22|, Theorem 5.12] applies and yields, using 

(I4.34p . a n {Zf^ — Z 0 — ME( + ) -A 1 -)- iV(0, Ax 2 ) as n —* 00 . Moreover, normal convergence of 
the endpoint of a random walk implies Donsker-type convergence of the entire random walk 
to a Brownian motion, see 


228 . Theorem 14.20]; hence, 

~ 2 ec + 


a 


Z ta-~ “ ta - 


—t aB t , 


(4.35) 


where B t is a standard Brownian motion and we have defined Zf also for non-integer t by 
(We define Z t and Zff below in the same way.) Here the convergence is in 


z: ■.= 


+ 


distribution in the Skorohod space D[ 0,H], but we may by the Skorohod coupling theorem 
[22 1. Theorem 4.30] assume that the processes for different n are coupled such that a.s. (14.351) 
holds uniformly on [ 0 ,AL]. 

Moreover, a~ 1 Ed + —>■ Xf 1 by (14.301) . and thus (I4.35P implies 


Olr, 


Z + _ 2 
ta„~ 


Z 0 ) —y exB+ + An t. 


(4.36) 


Proof of Theorem 2. inn In this case, a n X i I)0 — > 0, and, since 0 ^ Z 0 ^ A^o, it follows 
from (I4.36P that a n Zf _ 2 —y aB t + AJ-4, where (as said above) we may assume that the 

tOLn 

convergence holds uniformly on [0, A] a.s. For any fixed 5 > 0, the right-hand side is a.s. 
negative for some t G [0, 5], and thus w.li.p. a n Z + _ 2 < 0 for some t G [0, h]. Since Z m ^ Z 

tOL n 

it follows that w.h.p. m* ^ 8a~ 2 , i.e. the epidemic stops with Z m = 0 after at most 8a 
infections. Hence, w.h.p. 

8a~ 2 = o(na„). (4.37) 

Since 8 is arbitrary, this moreover shows Z = o p (a ~ 2 ). □ 


m 

-2 

n 


Proof of Theorem [£+|(iii) in the multigraph case. We combine the upper bound Zf n above 
with a matching lower bound. Let x±,X 2 ,... be an i.i.d. sequence of random half-edges, 
constructed before we run the epidemic by drawing with replacement from the set of all 
half-edges. Then, at the m:th pairing event, when we are to pair an infective red half-edge 

7 ^ y m , we pair y m with x m ; otherwise we resample and pair y m 

-1 if Xm is initially 


y m , if x m still is free and x rn 


with a uniformly chosen free half-edge 7 ^ y rn . Furthermore, we let : = 
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infective, and if x m belongs to an initially susceptible vertex of degree k , we let Q m be a copy 
of Y(k) — 1 (independent of the history); if x m still is susceptible at the m:th pairing event 
(and thus free, so we pair with x m ), we may assume that := ( m , the number of new red 
free half-edges minus 1. Note that (Cm)m^i is an i.i.d. sequence of random variables with the 
distribution Y(D~) — 1, where D~ has the distribution obtained by taking A 2 = 0 in (j4.26j) : 
furthermore, (Cm)i° are independent of Z 0 . Let Z~ := Zq-\-YCh=i Cf- Note that (I4.27jl - fl4.34p 
hold for D~ and Q m too (with some simplifications), and thus, in analogy with (I4.36p . 

a " n — ^°) a ^t + A 2 (4.38) 


We next verify that we can take the same Brownian motion 


was initially 
was initially 


for some Brownian motion B t 
in g 3 SD and (jOBI) . 

Let (' m := C m — Cm- Thus Cm = 0 if x m is susceptible at time T m . If x r 
susceptible, with degree k, but has been infected, then Cm ^ Cm + 2 ^ k. If x , 
infected, then Cm — — 1 and thus Cm ^ Cm + 2 ^ 1 . 

Consider as above only m ^ M [AlaC 2 ], for some (large) constant A > 0. For m > m*, 

when the epidemic has stopped, we have defined Cm = Cm- Since Cm = E( D ± ) — 1 and D~ is 
stochastically dominated by D + , we may in this case assume that Cm. = Cm ^ Cm) an d thus 

C' ^ 0. 

For m ^ M, the number of initially susceptible half-edges that have been infected is at 
most, using f!2.1ip and (12. 6 p . mds,* = 0{a~ 2 ds ,*) = oia^n 1 ^) = o[n). Hence the number 
of free half-edges at T m is at least kns t k — mds,* = A n — o[n ) ^ c^n for C 3 := A/2 if n 
is large enough. It follows that the probability that a given initially susceptible vertex of 
degree k has been infected before T m is at most mk/^Ji), and the probability that one of 
its half-edges is chosen as x m is at most k/^n) for every m ^ M. Similarly, the probability 
that x m is initially infective is at most Xi^/^c^n). 

Hence it follows from the comments above, using ( 12 . 101 ) and the assumption a n A^o = 0(1) 
that (Cm)+ : = max (Cv 0) has expectation 


m 


111 




OO 


E(C 

Let Z' := Ef'(C) 


k 


- 1 , 


nt - . A . k + Ml = 0 (“) + O(^) = 0(4_) = 0 ( 0 .). (4.39) 

cxn c-m cxn \nJ V n / \ainJ 

k=0 


Then, by (14.391) . 

E Z' — o(Ma n ) = o(«C 1 )- 

Furthermore, for m ^ M, 


(4.40) 


Z m — 


E<) 

3 =1 


< z m + z' < z+ + z'. 


(4.41) 


Since (I4.36P and (14.381) hold (in distribution), the sequence (a n (Z _ 2 — Z 0 ), a n (Z^_ 2 — 
Z 0 )), n ^ 1, is tight in D[0, A] x D[0, A], Moreover, every subsequential limit in distribution 
must be of the form ( aB + A^ 1 !, aBf + A^f) for some Brownian motions B^ and B+. 
Since a n Z' 0 by (14.401) . it then follows from (I4.4ip that for any fixed t G [0, A], B^~ Bf 
a.s. Since B^~ and B^~ have the same distribution, this implies B^~ = B+ a.s. for every fixed 
t, and thus by continuity a.s. for all t G [0, A], 
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Since all subsequential limits thus are the same, this shows that (I4.36jl and (14.38P hold 
jointly (in distribution) with B£ = B t . Finally, by (14.4ip and (14.40p . this implies 

a n (Z ta -2 - Z 0 ) -^4 aB t + in D[ 0, A]. (4.42) 

Since the infimum is a continuous functional on £)[0, A], it follows that 

a n fe - Zo) inf {crB t + A . (4.43) 

For convenience, denote the left- and right-hand sides of (14.431) by Y n and Y. Since the 
random variable Y has a continuous distribution, (14.431) implies that, uniformly in i6R, 

P (Y n ^ x) = P(Y < x) + o(l). (4.44) 

The Brownian motion B t in (14.421) (I4.43j) is arbitrary, so we may and shall assume that B t 
is independent of everything else. 

We have defined m* := min{m ^ 0 : Z m = 0} and M := [Aa~ 2 ], anc f fi lus 

P(m* ^ M) = P(jnf Z ta -2 ^ 0j = P(Y n < -a n Z 0 ). (4.45) 

Recall that and above are independent of Zq. Hence, if we fix two real numbers a 

and 6, and condition on the event S^ ,b := {a ^ a n Z 0 < 6}, then for every subsequence such 
that lim infu-Kjo P(£“ ,fe ) > 0, the arguments above leading to (14.36p . (14.381) and (j4.42j) - (j4.44j) 
still hold. (We need lim inf^,*, P(£“ ,6 j > o in order to get a conditional version of (14.40p .) 
Consequently, P(K n ^ x \ = P(Y ^ x) + o(l), and thus, recalling that B t is independent 

oiZ 0 , 

P (Y n ^ x and £“’ b ) = P(Y < x) P(£“’ b ) + o(l) = P(Y ^ x and £% b ) + o(l). (4.46) 

On the other hand, (14.461) holds trivially if P(£“’ 6 ) —> 0. Every subsequence has a subsubse¬ 
quence such that either lim inf^^ P(£“ ,b ) > 0 or P(£“’ 6 ) —>■ 0, and in any case (14.461) holds 
along the subsubsequence; it follows that (14.461) holds for the full sequence. 

In particular, for any a and b, 

P (Y n ^ -a n Z 0 and £,f) ^ P(Y n < -a and £“’ b ) = P (Y ^ -a and £,f) + o(l) 

^ P(Y ^ -a„Z 0 + b - a and £“’ b ) + o(l). (4.47) 

By assumption, a n X i ;0 is bounded, say a n Xj t0 ^ C for some constant C; thus 0 ^ a n Z 0 Y 

a n Xi } o ^ C. Let 5 > 0 and divide the interval [0, C] into a finite number of subintervals 
[o,j,bj] with lengths bj — cij < 6. By summing (I4.47P for these intervals, we obtain 

P(W Y ~oi n Zoj ^ P(Y ^ —a. n ZQ + h) + o(l). (4.48) 

Since <5 > 0 is arbitrary, this implies 

P(Y n ^ ^ P(Y Y — «nYo) + o(l). (4.49) 

Similarly, we obtain F(Y n ^ —a n Z 0 ) ^ P(Y ^ — a n Z 0 — S') + o(l) and P(Y n ^ —a n Z 0 ) ^ 
P(Y ^ — a n Zo) T o(l). Consequently, 

P(Y n ^ —a n Zo) = F(Y ^ —a n Zo) + o(l). 
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(4.50) 




































In other words, using (14.45ft and recalling the meaning of Y from (I4.43p . 


P(m* ^ M) = P ^inf (aB t + A 2 1 f) ^ —a n Z^j + o(l). (4-51) 


If m* ^ M = |_v4a~ 2 J, then, similarly as (14.371) in the proof of case 

Z ^ m* ^ Aa~ 2 = o(na„) 

(a) in Theorem 12.g[FI)| 


(4.52) 


so we are m case 


If m* > M, consider again m ** dehned by (14.13[) (but taking minimum over m ^ M), for 

> 0, then (I4.19P holds 


a sufficiently small 5 > 0. Note that, as in the proof of 


n 


if Z„ 


and w.h.p. T* > ea n for some small e > 0, and thus w.li.p. (b) in Theorem I2.(|FI)] holds 
In other words, for some small e > 0, if m* M, then Z < ena n , and if m* > M and 
Z m „ > 0, then Z > ena n w.h.p. 

We next show that the probability that neither of these happens is small. We condition 

using Lemma 02] on Z ( M+m)Am „ - Z M , and 

(4.53) 


on Zm and argue as in the proof of case 
find 


n 


m* > M and = 0 | Z M ) < 


8c 2 


C 1 OL n ZM 


(4.54) 


(4.55) 


(4.56) 


Hence, using also (I4.42p . 

P(m* > M and Z mtt = 0) ^ P (a n Z M < |A 2 l A) + 0(1/A) 

Y P (aB A + X^A < IX^A) + o( 1) + 0(1/A) 

— 0(1/A) + o(l). 

Using (14.5ip and the comments above, it follows that, if e > 0 is small enough, then 
P (Z < ea n n) = P(m* ^ M) + 0(1/A) + o(l) 

= P ^inf (<jBi + A 2 ^ — Oi n Z(^j + 0(1/A) + o(l). 

This holds for every fixed A > 0, and we can then let A —» oo and conclude that 

P (Z < ea n n ) = P ^^hif (&B t + A 2 ^ —a n Z 0 ^J + o(l). 

It is well-known that — inf t ^ 0 (aB t + A-^f) has an exponential distribution with parameter 
2A^' 1 /a 2 , see e.g. [32|, Exercise 11.(3.12)]. Consequently, since Z 0 and (B t ) are independent, 

P (Z < ea n n ) = Eexp(—2A^V -2 a ri Zo) + o(l). (4-57) 

Since we assume that a n X i i0 is bounded above and below, Z 0 Y W,o and Lemma I4.l|lii) imply 
that the expectation in (14.571) stays away from 0 and 1 as n —* oo. Moreover, if di t * = o(Xi t o), 
then Lemma fOIi l and (14.571) yield, using (14.331) . 

P (Z < ea„n) = exp(—2A^V~ 2 a ri 7r ri A 7 i i o) + o(l) = exp(—2AA 2 V^c^A^o) + o(l), (4.58) 

which yields (12.18P by the definition of a 2 in (I4.34p . 

Finally, (14.521) and the argument above, in particular (14.54p . shows that 

P(the epidemic is small but Z > Aa~ 2 ) = 0(1/A) + o(l), (4.59) 

which implies the hnal claim. □ 
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Proof of Theorem \2. 4jm) in the simple graph case. As said in Section [2l this result for the 
random simple graph G does not follow immediately from the multigraph case (as the other 
results in this paper do). We use here instead the argument for the corresponding result in 
[ 2 IL Section 6], with minor modifications as follows. We continue to work with the random 
multigraph G*. Also, we now allow initially recovered vertices, since our trick in Section EU] 
to eliminate them does not work for the simple graph case. 

Fix a sequence e n —* 0 such that Theorem holds, and let C be the event that there 

are less than e)l 2 n§Oi n pairing events; note that if £ occurs, then Z < £n 2 risot n , while if £ 
does not occur, w.h.p. Z > £ n ns<y n by a simple argument (using e.g. Chebyshev’s inequality); 
hence £ says w.h.p. that the epidemic is small. 

Furthermore, let W be the number of loops and pairs of parallel edges in G *; thus G* 
is simple if and only if W = 0, and we are interested in the conditional probability P(£ | 

W — 0). By [Ti| (at least if we consider suitable subsequences), W -^4 W for some random 
variable W, with convergence of all moments. 

We write W = W\ + W 2 , where W 2 is the number_of loops and pairs of parallel edges 
that include either an initially infective vertex (as in 2l[), or a vertex with degree at least 
d := l/a n . Then, by the assumptions 


E W 2 = O ( ,k + fc2 ( n s,fc + n R,k ) 


v k =0 


k^d 


1, Er=o kw 

' o 

n n z 


= o(l) (4.60) 


and thus it suffices to consider W\. Note also that if we fix a vertex v that is not initially 
infected and has degree less than d, then the probability that the infection will reach v within 
less than £n 2 risoi n pairing events is O {d£n 2 na n /n) = o(l), so w.h.p. v is not infected before 
it is determined whether £ occurs or not. 

The rest of the proof is exactly as in (2l[, to which we refer for details. □ 


Remark 4.4. The formula (I2.18p for the asymptotic probability that the epidemic is small 
holds only under the assumption d\ = o(Aj,o); be., that among the initially infective vertices, 
no vertex has a significant fraction of all their half-edges. Even if this assumption does not 
hold, the asymptotic probability can be found from (I4.57H . since as in the proof of Lemma 14.11 

Zq = JA Z {) l where the Z 0yi are independent and, using the notation in Lemma 14.31 Z 0 l = 
Y(di t i + 1)5 where dj^ is the degree of the i-th initially infective vertex. Hence, letting x 
denote the fraction in (12.181) . so x ~ the probability is 

]^[Eexp(-y7r“ 1 a n r(di ii + 1)) + o(l) = JJ(Eexp(-x7r“ 1 a n F(fc + l)))' 1I ’ fc + o(l). (4.61) 

i k 

A calculation, see Appendix [UJ shows that if we define 

V’n(fc) : = log exp(ka n xnf 1 (x l3n/pn - dx, (4.62) 

interpreted as 0 when p n = 0, then this probability is 

exp(—^ rii^ n {k)] + o(l), 

k 
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(4.63) 

















thus generalizing (I2.18jh It is easily seen that if n (k) = 0(k 2 ctffj and thus ^2 k n i,k' l Pn(k) = 
0(a n di t * ^2 k n i,kka n ) — 0(a n di^ under our assumption a n Xi 0 = 0(1), which explains why 
the extra term in (14.6311 disappears in (12.18[) . 

Note also that if n {k) ^ 0 by Jensen’s inequality; thus an extremely uneven distribution of 
the degrees of the initially infective vertices will increase the probability of a small outbreak. 


Appendix A. The Sellke construction for network epidemics 

House et al. 0 introduced a method to simulate the final size of a network epidemic that 
drew on the constructions of Sellke 34]] and Ludwig [23] (the latter in its most general sense 
as described by Pellis et al. 30]). This approach allows us to make a realisation by drawing 
three sets of random numbers, and then we can consider multiple initial conditions and the 
final sizes they produce without drawing further random numbers. 

Let individuals be labelled i,j,... £ {1, ...,n}. Let them be connected by a general 
network with adjacency matrix with elements G t] , and let Z* be an indicator variable taking 
the value 1 if individual i is eventually infected during the epidemic and 0 otherwise. Our 
algorithm then proceeds as follows. 

First, for each individual i pick an infectious period T* ~ Exp(p). Secondly, for each 
individual i pick a threshold Qi ~ Exp(l). This represents the individual’s resistance to 
infection. Thirdly, a random permulation P of the integers {1,... ,n} is chosen. The first 
m elements of this permutation are then taken to be the indices of the initially infectious 
individuals, i.e. we initialise Z % ■<— 1 if * £ P and Z* ■£- 0 otherwise. Then to arrive at the 
correct final values we iteratively search for each individual i that has Zi = 0 and set Zi 4— 1 
if 

Qi</3J2 G ij T i z i • (A.l) 

3 

This procedure is continued until no changes occur on a given iteration, giving the final size 
for that value of m. We can then increase m and continue the iterative procedure, allowing 
simulations that are both computationally efficient, and for which the total final size Z is 
monotone non-decreasing in the initial number infected m. 


Appendix B. Critical behaviour of G(n, (dj)" =1 ) 

In this appendix, we show how the methods in this paper yield an improvement of [[ 20 S . 
Theorem 2.4] on the size of the largest component in G(n, (<Jj)” =1 ) near criticality, replacing 
the moment condition used in j20| 

OO 

J2k 4+V n k = 0(n) (B.l) 

k=0 

for some 7] > 0, by the uniform summability of YlkLo k^ n k hi assumption (JnJ below, cf. 

This is more or less best possible, since if the limiting vertex degree distribution does not 
have a finite third moment, then (at least in typical cases), the size of the largest component 
is o p (no; n ), see van der Hofstad, Janson and Luczak (in preparation). 

Theorem B.l. Suppose that the degree sequences (di)^ =1 satisfy the following conditions. 

(i) There is a probability distribution (pkfkL 0 such that n k /n — > p k for each k ^ 0. 

(ii) For all e > 0, there exists M such that, for all n, k z n k < en. 
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(D2) 












Then the 


(iii) Pi > 0. 

( iv ) EEo k(k-2)p k = Q. 

(v) a n := Efclo k ( k — 2 )nk/n satisfies n 1//3 a n —* cx). 

Define the positive constants A := Efclo ^Pk and, 7 := Efclo — l)(fc — 2)p fe . 
Lei C\ and C 2 denote the largest and second largest components of G(n, (dj)” =1 ) 
number of vertices in C\ is 

. _ . 2A 

v(Ci) = —na„ + o p (na n ), 

7 

i/ie number of vertices in C\ with degree k ^ 0 is 


tr \ 2 k P k , 1 \ 

Vkifii) = —na n + Op(na n ), 


and the number of edges in C\ is 


7 


2A 


e(Ci) = — na n + o p (na n ). 
7 


(B.2) 


(B.3) 


(B.4) 


In contrast, C 2 has only v(C 2 ) = o p (na n ) vertices and e(C 2 ) = o p (na n ) edges. 

Remark B.2. If k ( k ~ 2 )Pk = 0 then p\ > 0 is equivalent to po + Pi +p 2 < 1 (condition 
(D6)). As a consequence, 7 > 0. 


Sketch of proof. We will explain how to modify the argument of 20]. The latter is similar in 


spirit (and actually inspired) our proof of Theorem 12.31 in Section |3] Indeed, the algorithm 
used in [20[ to construct the multigraph G*(n, (fi ? -)” =] ) and explore its components is closely 
related to the time-changed epidemic with zero recovery rate, i.e. p n = 0. In more detail, 
we begin with all vertices susceptible (or sleeping in the terminology of 20]). We choose a 
single vertex at random and declare it infective (or active in j20|). The infection eventually 
spreads through the whole connected component containing the chosen vertex because p n = 
0. The connected component thus comprises the susceptible (sleeping) vertices that were 
infected (activated) during this time, together with the initially infective (active) vertex. 
The procedure can be repeated until all the connected components have been explored. 

In particular, each susceptible (sleeping) vertex of degree k ^ 0 is still infected (activated) 
at rate k in the algorithm of ]20|; in addition one vertex is activated at the start of each 
new component. The number of vertices that would be sleeping if we ignore the latter 
type of activations (denoted Vk(t) in [20j) thus evolves as the Markov death chain S t (k) 
considered in the proof of Theorem 13.11 but with So(k) = nk- One can prove concentration 
of measure for S t (k), YlkLo t(k) and YlkLo kSfik), as in Theorem 13.11 The analogous result 
in {3] is Lemma 6.3 and its proof involves a non-trivial use of assumption (IB. ID . So we 
must replace (El . Lemma 6.3] with Theorem 13.11 Note that the former result achieves an 
O p {n}^orr! 2 + na4) bound on the error, versus the o p (ndfi) bound of Theorem 13.II However, 
it can be checked that the o p (na, 2 J bound is sufficient for the rest of the proof. 

There is only one other place where 0 uses assumption (IB.ID non-trivially; that is 
to control the Taylor expansion remainder term in the analogue of (I3.5ip (immediately 
preceding equation (6.7) on page 212). But we may obtain an adequate bound with the 
argument leading to (I3.49D . 

All of the other modifications are trivial. □ 
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Appendix C. Proof of (14.631) 


Let c n := x^n 1 and n °te that c n = 0(1) by (14.ip and (12.20)) . If p n > 0, then by the 
definition of Y n {k ) before Lemma [4.31 and the substitution x = e~ PnT , 

Ee -c n a n Y(k+ 1) = j (^1 + (1 - e~ pnT ) ( e ~ Cnan - 1) j p n e~ pnT dr 

= j" (l - (l - x Pn/Pn ) (l - e ~ Cnan ) j dx 

1 - (l - x ^ n/pn ) (c n a n ( 1 + 0(a n ))) j dx 
= j exp(^—k(l — x l3n ^ pn )[c n a n + 0(aD)^jdx 

— e°( ka ^ J exp(c n a n k(x l3ri / pn — l) j dx 

= e -cnn„a„k+0(kal) f eX p (c n a n k {x^ n/p 


Pn 

/3n+pn 


) j dx 


— e -xank+ipn(k)+0(ka1) 


(C.l) 


If p n = 0, then 7r n = 1 and Y(k + 1) = k, and (10. 1 j) is trivial. 
We obtain (j4.63j) by using (IC.ip in (I4.6ip . 
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Figure 1. The relationship between epidemic final size and initial force of 
infection for 20 realisations of the network Scllkc construction for n = 10 5 
(top), n = 10 6 (middle) and n = 10' (bottom), p — 1, /3 — 1, and Poisson 
degree distribution with mean A = 2.02 leading to 1Z 0 = 1.01. 
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